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FOREWORD 


Compiled in this volume are twelve papers from agencies working with the 
Guidance Laboratory of NASA-ERC. These papers concern special studies in the 
disciplines of guidance theory, trajectory analysis, and celestial mechanics. They 
include: 


1. A presentation of a method for classifying and analyzing guidance 
modes, together with a survey and classification of existing modes; 

2. A development of a minimum fuel guidance solution for certain 
hyperbola-circle transfers; 

3. A presentation of generalized necessary conditions for a minimizing 
control function; 

4. A demonstration of how many non-standard problems in control 
theory can be transformed to "ordinary differential form" and 
treated in a standard way; 

5. A study of impulsive transfer optimization, presenting a computational 
scheme for determining optimum n-impulse trajectories; 

6. A demonstration of the utility of a group theoretical principle in 
solving non-homogeneous linear systems of differential equations 
and an application to the perturbation of elliptic motion; 

7. A derivation of a useful form of the Sundman inequality; 

8. A study of a formal approximate decoupling of the n-body problem 
into a k-body problem (k < n) and an (n - k + 1) - body problem; 

9. A presentation of theory for the solution of the perturbation problem 
for a system of non-linear differential equations under general 
linear two-point boundary conditions; 

10. A presentation of an algorithm for automatic computation of 

derivatives of a composite function and its application to a power 
series solution of a celestial mechanics problem; 


iii 



11. A presentation of results on the long-term behavior of close lunar 
orbiters; 

12. An analytical study of the three-dimensional stability of motion of a 
particle near in a non-linear Earth -moon gravitational field. 

The first paper provides a framework enabling many specific studies to 
result in an accumulating body of guidance mode information, useful to both the 
guidance theorist in research work, and the guidance software engineer in planning 
and development work. 

The second paper provides one guidance solution of interest to the guidance 
theorist, and could form the basis for a guidance mode to be studied from the 
viewpoint of the first paper. 

The third paper provides the trajectory analyst with tools for attacking a 
broader class of optimization problems, and the fourth paper points out a way he 
may profitably apply a large body of results to his problems which are not of the 
"ordinary differential" type. 

The fifth paper aids in the search for optimum impulsive orbit transfers, 
and the sixth through the ninth papers contribute celestial mechanics theory useful 
toward the solution of problems occurring in mission design, orbit determination or 
orbit prediction. 

The tenth paper supports both automatic symbolic processing efforts and 
computational requirements in trajectory analysis and celestial mechanics. 

The last two papers aid the trajectory analyst or mission designer interested 
in stable orbits near the moon or its equilateral libration points. 
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SUMMARY 


This volume contains technical papers on NASA- 
sponsored studies in the areas of trajectory analysis 
and guidance theory. These papers cover the period 
beginning 1 October 1966 and ending 1 October 1967. 

The technical supervision of this work is under the 
personnel of the Guidance Laboratory at NASA-ERC. 
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INTRODUCTION 


This document contains twelve technical papers covering work sponsored by 
the Guidance Laboratory of the NASA Electronics Research Center (ERC). The 
papers are concerned with guidance theory, trajectory analysis, and celestial 
mechanics. 

The following table presents the contributing institutions and the discipline of 
the paper. 


Institution/ 

Company 

Author 

Discipline 

TRW 

C.G. Pfeiffer 

Guidance Theory 

AMA 

T.N. Edelbaum 

Guidance Theory 

Northeastern Univ. 

J. Warga 

* 

Trajectory Analysis 

Princeton Univ. 

P.M. Lion 

Trajectory Analysis 

CRA 

D. Lewis/P. Mendelson 

Celestial Mechanics 

IBM 

P.Sconzo/D. Valenzuela 

Celestial Mechanics 

Stanford Univ. 

J. Vagners/H. B. Schechter 

Celestial Mechanics 


* Two papers 

** Four papers 

Synopses of the individual papers are presented below: 

Paper No. 1 

The first paper by C.G. Pfeiffer of TRW surveys the guidance modes 
presently in use or in development, with the objective of appraising their usefullness 
when applied to a unified guidance concept. Unified guidance refers to the application 
of one hardware system as well as one overall computational scheme for guidance 
in all phases of a mission. 
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INTRODUCTION 


The breadth of the study called for, and the importance of the consequences 
drawn from it, made it advisable to review critically concepts and terms in use in 
space guidance and to update definitions where necessary. Thus this paper starts 
with defining terms such as guidance, guidance mode, unified guidance and others. 

The paper also elaborates on the relationship of guidance to navigation. 

Chapter in then gives a rather comprehensive survey of presently known 
guidance modes (USA-developed) and categorizes them according to various principles 
of division. 

For the evaluation of guidance modes in general, as well as with respect to 
the applicability to unified guidance, measures of performance are suggested in the 
following chapter. These concern such criteria as optimality, accuracy, applicability, 
flexibility, and others. 

The paper does not aim at evaluating all the listed modes, but rather tries to 
lay the groundwork for such an attempt. 

Paper No. 2 

The second paper, a technical progress report from Analytical Mechanics 
Associates, Inc., by T.N. Edelbaum, presents a minimum fuel guidance procedure. 

It presents a first-order impulsive correction procedure for mid-course guidance, 
along with a procedure for establishing approximate equations of motion which are 
then integrated. This guidance procedure is classified in the paper by C. Pfeiffer 
(No. 1 in this document). 

Paper No. 3 

The third paper, written by Jack Warga of Northeastern University, adds 
directly to the results presented by him in the first compilation in this series.* In 
the referenced paper. Prof. Warga presented existence and approximation theorems 
for minimizing controls applicable to a general class of optimization problems. The 
present paper contains a statement and proof of some corresponding conditions that 
are necessary for the solutions to be minimizing. 


* NASA, First Compilation of Papers on Trajectory Analysis and Guidance Theory, 
NASA Scientific and Technical Information Division, Wash., D.C. , 1967. 
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Paper No. 4 

In the fourth paper. Prof. Warga has presented some observations that 
appear to have been overlooked by some workers in mathematical control theory 
faced with problems that are not of the ’’ordinary differential” type. He illustrates 
with three specific examples of how one may transform the problem to one which is 
immediately treatable with classical results of control theory. 

Paper No, 5 

The fifth technical paper is concerned with trajectory analysis. Written by 
P. M. Lion of Princeton University, it presents a study of impulsive trajectory 
calculations. It is based on Lawden' s primer vector and presents necessary 
conditions for a trajectory to be optimum. This primer vector has significance 
for non-optimal trajectories. When these ideas are combined, a computational 
scheme for determining optimum n-impulse trajectories is suggested. 

The remaining papers are on various aspects of celestial mechanics. 

Paper No. 6 

The sixth technical paper, written by D. C. Lewis and Pinchas Mendelson, 
formerly of Control Research Associates and now of Zetesis Corp. , exploits a 
principle which considers a system of differential equations invariant under continuous 
and differentiable group transformation. It shows that it is possible to write down 
a number of linearly independent solutions of the variational equations equal to the 
number of independent parameters of the group. This exploitation is used to present 
several solutions to the Keplerian case. 

Paper No. 7 

The seventh technical paper, written by D. C. Lewis, and entitled, ’’Comments 
on the Sundman Inequality, ” presents some preliminary theorems in vector analysis 
and applies them in developing the Sundman inequality in the form used in former 
papers . 
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Paper No. 8 

The eighth paper, also written by D.C. Lewis, discusses partial and complete 
decoupling of a n-body problem into a k-body problem (k < n) by use of developed 
parameters of the problem. Theory is presented to develop the so-called quasi- 
first integrals of the partially decoupled system from the first integrals of the 
unreduced system. 

Paper No. 9 

The ninth paper, also written by D.C . Lewis, discusses the perturbation 
problem for a system of non-linear differential equations. The problem is reduced 
to solving k ’’bifurcation" equations where k is the degeneracy of the problem. 

A Green’ s matrix for linear systems with boundary conditions of arbitrary degeneracy 
is constructed. 

Paper No. 10 

The tenth paper, written by P. Sconzo and D. Valenzuela of IBM, presents 
work done on automatically computing the derivatives of a function by the recursive 
Schlomilch-Cesaro formulation. The general expression obtained was used to 
construct the power series expansions in the time variable of those powers of the 
radius vector which appears most frequently in celestial mechanics. 

Paper No. 11 

The eleventh paper, written by J. Vagners of the University of Washington 
(Stanford University), presents results on the long-term behavior of close lunar 
orbiters. A Hamiltonian is presented with the short and medium terms removed. 

The long-period motion is thus analyzed and results presented. 

Paper No. 12 

The last paper in this report was written by Hans B. Schechter of Stanford 
University. This paper presents an analytic study of the three-dimensional stability 
of motion of a particle near L^ in a non-linear Earth-moon force field. A linear 
solar gravitational field distribution is superimposed on the Earth-moon field. The 
long-period features of the motion of the particle are studied. 
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Two internal publications authored or co-authored by members of the 
sponsoring laboratory and in the subject technical fields have appeared since the 
last compilation. These are listed below with their abstracts. 


Miner, W. E. , and J. F. Andrus: Necessary Conditions for Optimal 
Lunar Trajectories with Discontinuous State Variables and Inter- 
mediate Point Constraints. NASA TM X-1353, April 1967. 

ABSTRACT 

The guidance regime for an optimal multi-stage lunar trajectory is derived 
by applying the mathematics of the calculus of variations as established by Denbow 
for the generalized problem of Bolza. The steering angle programs for four 
constant thrust level phases and the times to initiate and terminate the two coast 
phases are determined in order to place maximum payload into a specified lunar 
orbit. The problem considered here is to determine an optimal trajectory 
consisting of six sub-arcs utilizing three vehicle stages on which maximum payload 
is transported from an exo-atmospheric point near the Earth to a specified lunar 
orbit. The intermediate point constraints include two points at which stages are 
separated and mass discontinuities occur, an Earth parking orbit of specified 
energy and angular momentum magnitude, and four thrust magnitude levels. The 
Euler- Lagrange equations determine the optimal steering for the thrusting phases 
and the Denbow transversality equations are used to calculate the discontinuities at 
the ends of the sub-arcs. This method is applied here and the equations necessary 
to solve this problem using a high-speed computer are derived. 


Hoelker, R. F. : Numerical Studies of Transitions between the Restricted 
Problem of Three Bodies and the Problem of Two Fixed Centers and 
the Kepler Problem. NASA TM X-1465, November, 1967. 


ABSTRACT 

For the comparison of the trajectories of the two fixed-center problem with 
those of the restricted problem of three bodies, fields of trajectories are numerically 
computed for six initial position conditions, all starting on the line of masses and 
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perpendicularly to this line. The transition is studied by varying stepwise, for these 
fields, the rate of revolution of the primaries from zero to the equilibrium rate of 
circular motion. 

For the investigation of the behavior of trajectories in the transition from the 
Kepler problem to the restricted problem, the Kepler orbits are considered in a 
coordinate system rotating at the rate determined by the continuation of this system 
into that of the restricted problem. Transition characteristics are shown on samples 
of periodic orbits and of trajectory fields. In particular, the continuous transition of 
a family of Kepler orbits into periodic orbits about the smaller primary as well as 

about the L -Tibration point is demonstrated. 
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AN ANALYSIS OF GUIDANCE MODES 

By C. G. Pfeiffer 

Head, Mathematical Physics Section 
Guidance and Analysis Department, TRW Systems, 
Redondo Beach, California 

NASA Contract NAS 12-593 
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GUIDANCE MODES 


An Analysis of Guidance Modes 1 
C. G. Pfeiffer 2 


1. Introduction 


Guidance theory is concerned with the solution of a two-point boundai, 
value problem, which arises when one attempts to control 3 the translation 
motion of a space vehicle so as to attain desired end conditions at mission 
completion. That is, given the initial conditions, such as the inertial 
position and velocity of the launch site, and the desired end conditions, 
such as the orbital elements of the final satellite orbit about the moon* 
or planet, and a description of the characteristics of the rocket engines 
and/or aerodynamic lift /drag maneuvers which supply the controlling 
accelerations during the mission, the guidance analyst must design an 
algorithm for calculating the translational accelerations to be applied as 
the vehicle moves toward the final time. Then: 

Definition 1: Given a mathematical model of the motion of a space 

vehicle, a description of the translation acceleration which can be 
commanded by the guidance system, and an estimate of the state of the 
overall dynamic system, guidance is the task of calculating and executing 
a realizable acceleration profile which will cause the trajectory of the 
space vehicle to attain desired end conditions, where 

Definition 2: The state of the space vehicle system consists of the 

position and velocity of the vehicle, the parameters determining the 
vehicle performance capability, and the parameters determining the 
gravitational and atmospheric accelerations. 

The estimate of the state is obtained from the navigation system, where 

Definition 3: Navigation is the task of estimating the state of the 
space vehicle system from sensed data, such as the first and second integrals 
of on-board accelerometer data, and/or earth-based tracking data, and/or 
on-board observation of a celestial reference. 

Definition #1 states that guidance encompasses guidance theory as 
well as the mechanization of the theory. Guidance mechanization usually 
concerns the guidance theoretician only to the extent that it affects his 
analytical treatment of the problem. For example, he usually assumes that 
the attitude control problem can be ignored, where 


Acknowledgement: This paper was written under NASA contract NAS 12-593, 

administered by Electronics Research Center, Cambridge, Massachusetts. 

The classifications of guidance modes and measures of performance were 
suggested by Mr. W. E. Miner and Mr. D. H. Schmieder of the Guidance Theory 
and Trajectory Analysis Branch of ERC. 

2 Head, Mathematical Physics Section, Guidance and Analysis Department, 

TRW Systems, Redondo Beach, California. 

3 G uidance theory is a special case of final value control theory. 
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Definition 4 : Attitude Control is the task of attaining and stab- 

ilizing the vehicle in the attitude configuration called for by the 
guidance system. 

This approach is reasonable for most applications, because the guidance 
and attitude control response times are usually so different that there is 
negligible interaction.* On the other hand, the analytical treatment will 
certainly be dependent upon the functional form of the guidance accelerations, 
which might be applied in the form of impulsive changes of velocity, realized 
by thrusting with a relatively high acceleration level for a relatively 
short time; by starting, throttling, steering and/or shutting off a rocket 
engine which thrusts for a relatively long period of time, and/or by applying 
lift and/or drag accelerations during motion in the atmosphere, realized 
by commanding motion of aerodynamic surfaces. With these considerations in 
mind, the analyst seeks to design a guidance "mode", where 

Definition 5 : A guidance mode is a policy for calculating the para- 

meters and functions which will accomplish the guidance task. 

Since navigation information will be gathered during the mission in order to 
update the estimate of the state of the system, a guidance mode must be 
capable of acting as a real-time feedback final-value control law. 

In general, there exists an infinite variety of guidance modes which 
will accomplish mission objectives. Thus one seeks an optimal guidance 
policy which satisfies the end conditions while minimizing some performance 
index, such as engine propellant expenditure, or else one pre-specif ies a 
functional form which is near-optimal. Present practice is to simplify 
the overall optimization problem by treating it as a sequence of two-point 
boundary value problems. That is, the overall mission is thought of as a 
sequence of "phases", usually characterized by the means available for 
applying the guidance accelerations. The objective of the guidance system 
for any given phase is to attain an intermediate set of pre-specif ied end 
conditions. For example, a guidance phase might consist of transfer by 
means of relatively high rocket thrust acceleration from a near-earth 
circular parking orbit to a specified earth-escape hyperbola. (A more 
detailed description of guidance phases is given in Appendices A and B) . 

Such intermediate end conditions must be obtained by a "targeting" method 
(discussed in Appendix C) . The imposition of these constraints on the 
overall trajectory leads to a sub-optimal overall guidance law, but, since 
each phase can be treated individually, the design of appropriate guidance 
modes is much simplified. In practice, guidance modes for the Individual 
phases are usually quite different in form. Considering also the diverse 
forms of guidance mechanization employed for the various phases, it is true 


*Note that stability is not an important consideration in the guidance 
problem, for the duration of guidance is finite and short compared to the 
response time of the dynamic, system defined by the translational equations 
of motion. This is not true for the attitude control problem, indeed, 
stability is usually the primary design goal. 
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that essentially different guidance systems are presently used on a given 
mission. 

It seems clear that it would be desirable to design a unified 
guidance system for future applications, where 

Definition 6: A unified guidance system is one capable of guiding 

all phases of a given mission. 

Toward that end, it is the objective of this paper to provide a cursory 
survey of the present state of guidance theory (see also References 1-4), 
to describe and classify existing guidance modes, and to define some quali- 
tative measures of their relative performance. The purpose of such a study 
is to aid in the rational selection and/or development of guidance modes 
for future missions, to facilitate the synthesis of the chosen modes into 
a unified guidance concept, and to point out problem areas where further 
development is required. 

2. The Separation of Guidance and Navigation 

Implicit in the definitions of guidance and navigation is the assump- 
tion that these two problems can be treated separately. That is, in present 
practice the guidance theorist designs a guidance mode by assuming that 
the state is known perfectly, and for real-time applications employs the 
estimated state in place of its true value. This assumption, which is 
essential to a meaningful discussion of existing guidance modes, requires 
further clarification. 

Strictly speaking, the deterministic derivation of a guidance mode 
is not correct, for the predicted end conditions which determine the guid- 
ance functions become random variables if there are random estimation 
errors and random systematic disturbances to the trajectory. In effect, 
the state of the system can no longer be defined simply by position, velocity, 
and system parameter vectors. Instead, the state must be thought of as the 
expected value of these quantities plus all the statistical moments of their 
distribution. In other words, the state can only be described by the condit- 
ional probability density function of the state, given the navigation data. 
From the point of view of guidance optimization theory, the random behavior 
of the dynamic system implies that there no longer exists a field of 
solutions which are the characteristics of the deterministic Hamilton-Jacobi 
partial differential equation. Thus, conceptually at least, the notion of 
a predictable reference trajectory has to be abandoned. 

Simple examples of stochastic control problems (see Appendix D) would 
seem to indicate that the deterministic guidance analysis is not at all 
valid for realistic problems. As a practical matter, however, stochastic 
guidance analysis is not required for those applications where (1) an 
apriori reference trajectory is available, and (2) the random navigation 
errors and random systematic errors are small. That is, the deterministic 
analysis applies when the first variation (or perhaps the first and second 
variations) about some reference trajectory is the dominant consideration. 

In a first variation analysis the random errors enter linearly and the 
deterministic approach can be theoretically justified (see work by Joseph, 

Lou, and Gunkel) . Consideration of the second variation (or second and 
third variations) does not change this conclusion. It then follows that 
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results similar to the deterministic case are obtained but with additional 
terms which can be interpreted as non-linear corrections or biases which 
can be calculated. Such assumptions can be thought of as devices to sim- 
plify the difficult computational problem of computing expectations. 

The assumptions required to justify deterministic guidance analysis 
usually apply to those phases of a space mission where continuous guidance 
accelerations are applied, but stochastic considerations become important 
when the guidance is applied in the form of a sequence of small velocity 
impulses at unspecified times. The difficulty in this case is finding the 
times of application. Present practice for the Ranger-Mariner-Surveyor 
type of mission (Reference 5) is to pre-specifv these times by heuristic 
or empirical rules, and to calculate the real time corrections with a 
linearized deterministic rule. The non-linear effects of the corrections 
are treated by an iterative technique. Stochastic considerations are intro- 
duced by employing a non-linear maximum likelihood estimator in the naviga- 
tion equations, and appropriately modifying the targeted end conditions so 
as to take into account the statistics of the estimation and systematic 
errors. Small real time (adaptive) variations in the time of application 
of the corrections are sometimes allowed. This approach demonstrably works 
well for many applications. 

One concludes, then, that the separate treatment of the guidance and 
navigation problems can indeed be justified for most present day space 
missions, and that deterministic guidance analysis is valid, if appropriate 
approximations are made and if appropriate constraints are placed upon the 
guidance policy. 

3. Description and Classification of Typical Guidance Modes 

The primary purpose of this paper is to classify typical guidance 
modes, and to define qualitative measures of their performance (see Intro- 
duction). In this part, various guidance modes will be classified according 
to the mathematical approximations and/or assumptions introduced in their 
derivation (Table 1). Although many of the methods discussed have never 
been used in real time applications, they must be considered as possible 
modes for future missions. 

Class 1; Precise Model of Dynamic System - The guidance mode is based 
upon a mathematical model representing all known accelerations on the vehicle 
which are numerically significant. 

Class 1.1: Expansion of Solutions - The Class 1 guidance mode gene- 

rates the control as an explicit function of the state for all states in 
some region of applicability. 

Class 1.1.1: Linear Expansion - The control is a linear function of 

the state. Examples are: 

a) delta guidance - guide to null deviations (6) 
from a standard trajectory 

b) lamda matrix guidance (Reference 6) 

c) second variation guidance (References 7,8) 

d) impluse velocity-to-be-gaired (References 3,5) 

e) steering to velocity-to-be-gained (Reference 3) 
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Class 1.1.2: Non-Linear Expansion - The control is a non-linear 

function of the state. Examples are: 

a) dynamic programming (Reference 9) 

b) solution of Hamilton- Jacobi partial differential 
equation 

c) series expansion of equations of motion 

d) series expansion of solutions (Reference 10) 

e) series expansion of Hamiltonian (Reference 11) 

Class 1.2: Successive Approximations - The class 1 guidance mode 

generates the control function by successively applying in real time an 
algorithm which converges after some number of iterations to the optimal 
control. 

Class 1.2.1: Direct Methods - A suitable approximate ordinary 

extremum problem is defined in which only a finite number (N) of parameters 
is to be determined, and then the class 1.2 guidance mode generates the 
control by passing to the limit as N + « in the solution of the approximate 
problem. Essentially, a minimizing sequence of control functions is con- 
structed, and the desired solution is obtained by a limiting process based 
upon this sequence (Reference 12, pp 174-175). Some approaches which might 
be used are: 

a) steepest descent (References 13,14) 

b) Rayleigh-Ritz method (Reference 12, pp 175-176) 

c) method of finite differences (Reference 12, 
pp 176-177) 

Class 1,2.2: Indirect Methods - The class 1,2 guidance mode succes- 

sively approximates in real time the control which satisfies optimality 
conditions and the desired end conditions. Examples are: 

a) optimal impulsive velocity-to-be-gained (Ref- 
erence 5) 

b) second variation control (References 7,8) 

c) sweep method (Reference 15) 

d) quasilinearization (Reference 16) 

e) quasi-second order approximations (Reference 17) 

Class 2: Approximate Model of System Dynamics - The guidance mode 

is based upon an approximate mathematical model of the vehicle dynamics 
and/or the gravitational and atmospheric acceleration. Essentially, approx- 
imations are introduced into the physical model in order to simplify the 
mathematical solution of the equations of motion. 

Class 2.1: Closed Form - The Class 2 guidance mode yields a closed 

form solution for the control in terms of the given initial conditions and 
the desired end conditions. 

Class 2.1.1: Approximation of Environmental Accelerations - The 

derivation of the class 2.1 guidance mode follows from approximating the 
first and second integrals of the gravitational, drag, and/or lift acceler- 
ations between the initial and final times as relatively simple functions of 
initial and end conditions. Examples are: 

a) iterative guidance mode (Reference 18) 

b) MIT explicit guidance (Reference 19) 
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c) TRW explicit guidance (Reference 20) 

d) Lewis Research Center explicit guidance (Reference 21) 

e) Aerospace explicit guidance (Reference 22) 

f) Robbins explicit guidance (Reference 23) 

Class 2.1.2: Conic Model - The derivation to the class 2.1 

guidance mode is based upon the assumption that the orbit desired at thrust 
termination is a conic, and that thrust acts for a relatively short 
duration of time. Examples are: 

a) impulsive velocity-to-be gained (Reference 3) 

b) steering to velocity-to-be gained (Reference 3) 

Class 2,2: Successive Approximation of Closed Form Guidance - The 

Class 2 guidance mode employs algorithms similar to those of Class 2,1, 
but the errors due to these approximations are iteratively reduced in real 
time. 

Class 2,2.1; Successive Approximation of Environmental Accelerations - 
The approximated values of the acceleration integrals are successively 
improved in real time by evaluating the integrals on the current real-time 
predicted trajectory. No examples of this technique are known to the 
author. 

Class 2.2.2: Successive Approximation of Conic Model - The effects 

of neglected terms in the conic model are calculated in real time as time 
varying perturbations acting on the current approximate trajectory, and 
these effects are introduced into the guidance equations. This approach 
is similar to general perturbation techniques well-known in celestial 
mechanics, but no applications to the real-time guidance problem are known 
to the author. 

4. Measures of Performance 

The class of closed form guidance modes has recently received much 
attention, because one obtains a versatile control law which can treat 
large perturbations and yet requires relatively little preflight calculation. 
This approach has limited application, however, because the simplifying 
approximations required for the derivation cannot be justified in general. 

For example, at any iteration of the guidance calculation it may be assumed 
that the gravitational acceleration for the remainder of the flight can 
be approximated by a constant vector. Such an approximation obviously 
works well when applied to short powered flight arcs, but for long arcs 
it can lead to serious degradation of performance. Some one of the Class 
1 modes could be used to eliminate such difficulties, but then extensive 
preflight calculation and storage might be necessary, or, if a linear 
guidance law is employed, the performance could be poor in the presence 
of large perturbations. Thus the choice of a guidance mode is often 
ad hoc, and one must consider many factors. Some measures of performance 
are : 

1, optimality - given that there is a performance index to be 
minimized, say propellant expenditure, how does the obtained 
value of the performance index compare to the theoretical 
minimum? 
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2. accuracy - given that approximations are introduced into the 
derivation and mechanization of the guidance equations, what 
are the resulting errors in the desired terminal conditions? 

These errors can be classified according to: 

2.1 approximation errors - due to analytic approximations 
introduced into the derivation of the guidance equations. 

2.2 computer errors - due to the inaccuracies of the numerical 
algorithms used to implement the guidance equations 
(truncation and roundoff). 

2.3 mechanization errors - due to the inability of the vehicle 
to physically respond to the guidance commands. 

There are also navigation errors, but, insofar as the guidance and 
navigation problems are separable (i.e., assuming superposition 
of effects) , these errors need not be considered in the design 
of a guidance mode. 

3. region of applicability - what is the range of perturbations 
which can be adequately treated by the guidance mode? 

4. computer factors - what are the real time on-board and/or 

earth-based computer requirements, in particular, how much storage 
space is required, what is the length of the computing cycle for 
each iteration of the guidance equations, and how complex must 
the computer be? 

5. preflight preparation - what is the cost in time and money of 
preflight preparation of the guidance equations, in particular, 
how long does it take to prepare the guidance system to 
accomplish a given mission? (the "quick reaction" problem) . 

6. flexibility - what are the types of missions which the 
guidance mode can perform, and how well can it adapt to changes 
in the mission, such as variations of launch azimuth? (another 
aspect of the "quick reaction" problem). 

7. growth potential - what is potential applicability of the 
guidance mode to future missions? 

5. Conclusions 

It is hoped that the classification method and defintiions of 
measures of performance developed here will be of use in the synthesis 
of modes (or mode) for a unified guidance system for advanced space 
missions. Such a system should encorporate the best features of the various 
modes, and improve upon their limitations. Although existing technology has 
been adequate for present day missions, some challenging unsolved problems 
remain in the area of optimal stochastic guidance, especially for the case 
of impulsively applied guidance corrections (see Part 2) . Some interesting 
results have been obtained (References 24 - 28) but more research is needed. 
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Appendix A: Description of Guidance Phases 

Present practice in guidance system design is to segment the overall 
trajectory into a sequence of thrusting and coast periods which can be 
thought of as "phases”, where 

Definition 7 : A guidance phase is a segment of a trajectory, usually 

characterized by the means available for applying the guidance accelera- 
tions, having a distinct guidance objective, i.e., specified end 
conditions. 

The guidance mode in each phase attempts to null errors resulting frc«n 
the previous phase, plus errors due to any current disturbances, by 
attaining the end conditions specified for the phase. Thus a guidance 
system might be called upon to solve in real time many different types 
of two-point boundary value problems for a wide range of initial conditions. 
Possible types of guidance phases for advanced missions are: 

• High Thrust Continuous Guidance 

Launch vehicle guidance 

a. Initial ascent to altitude 

b. Booster stage 

c. Ascent to orbit 

d. Transfer from parking orbit 

Terminal guidance 

a. Retro into lunar or planetary orbit 

b. Injection into earth satellite orbit 

c. Descent from orbit 

d. Soft landing and hovering 

• Low Thrust Continuous Guidance 

Spiral escape from earth 

Earth to target transfer 

a. Lunar 

b. Interplanetary 

Spiral capture by target body 

Continuous orbit adjustment 

a. Earth satellite 

b. Lunar satellite 

c. Planetary satellite 

d. Earth-target transfer orbit 
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• Impulsive Guidance 


a. 

Midcourse 


b. 

Approach 


c. 

Terminal 


d. 

Satellite orbit trim 

e. 

Descent from 

orbit 

f. 

Soft landing 

retro 


• Aerodynamic Guidance 

a. Control of lifting reentry 

b. Control of ballistic reentry 

c. Drag brake control 

d. Parachute control 

Since there are many types of guidance phases, it is usually the 
case that more than one guidance mode will be employed during a mission, 
and more than one command mechanization subsystem will be used. Indeed, 
the guidance techniques for the various phases are so different that 
essentially different guidance systems are used during a mission. 

Appendix B: Guidance Phases for a Typical Lunar Mission 

Ascent Phase 


The ascent phase begins at launch and extends to injection into a 
nearearth circular parking orbit, which might have a standard altitude of 
100 n mi above the earth's surface. A typical ascent phase might last 
8 to 10 minutes. The objective of the guidance system is to attain circu- 
larity (eccentricity equal to zero) at an altitude close to the standard 
value. Guidance corrections are applied by steering the vehicle with the 
gimbaled rocket nozzle and by making small changes in the thrust termination 
time of each rocket stage. The disturbances to the flight path consist of 
imperfectly applied thrust acceleration and external forces, such as wind 
and air density variations. The position and velocity of the vehicle are 
measured by integrating the outputs of accelerometers mounted on an 
inertially fixed platform within the vehicle, or from ground-based tracking 
radars, or from both these sources. 

Parking Orbit Phase 

The parking orbit phase begins at parking orbit injection and 
extends to the restart of the launch vehicle for the injection phase. 

Typical parking orbit durations are 1 to 20 minutes (4 to 80° of coast arc) 
but they can be indefinitely long. There are usually no guidance correc- 
tions required during this phase, but some vernier adjustment of the errors 
remaining from the ascent phase might be made. The disturbances to the 
flight path are negligibly small for short coast arcs, but otherwise 
atmospheric drag becomes important. The position and velocity of the 
vehicle are determined as in the ascent phase, but celestial observations 
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can be incorporated if the parking orbit is sufficiently long. A 
rendezvous and docking of two or more vehicles may occur in this 
phase, the purpose being to assemble a spacecraft capable of completing 
the remainder of the mission. The rendezvous does not alter the 
essential character of the guidance problem and will not be discussed 
here . 


Injection Phase 

The injection phase begins at restart of the launch vehicle and 
extends to injection into earth-moon transfer orbit (which is an 
ellipse relative to the earth with eccentricity of 0.987 for a 66 hour 
transfer). The duration of the injection phase is typically 2 to 3 
minutes. The objective of the guidance system is to attain a transfer 
orbit which will cause the spacecraft to impact the desired target point 
on the moon. The corrections are made as in the ascent phase. The 
disturbances to the flight path are primarily due to imperfectly applied 
thrust acceleration. The position and velocity of the vehicle are 
determined as in the ascent phase. 

Midcourse Phase 


The midcourse phase begins at injection and extends until the 
spacecraft enters the "sphere of influence" of the moon, a point which 
is not precisely defined but is approximately 60,000 km from the moon. 

The duration of a typical midcourse phase is roughly 50 hours. The 
spacecraft is separated from the launch vehicle during this period. The 
primary objective of the guidance system is to correct for errors in the 
injection phase, thus providing a vernier adjustment. There are in 
addition some small disturbances to the flight path to consider, such as 
solar winds, leaking gas jets in the attitude control system, and errors 
in the assumed values of the physical constants which define the mathe- 
matical model used to construct the standard trajectory. The orbit is 
determined from celestial sightings and/or earth-based radar data. The 
guidance corrections are performed by applying short-duration Impulses of 
acceleration (on the order of a minute long) with a small rocket engine so 
as to achieve "delta functions" of velocity. The magnitude of the correction 
is determined by the duration of the thrusting, and the desired direction 
is attained by properly pointing the spacecraft. One or more corrections 
might be made, the first no sooner than 5 hours after injection so as to 
allow time to determine the orbit, and others (usually not more than two) 
as required to null errors in the previous correction. The total impulse 
applied in the midcourse phase depends primarily on the injection error, 
but is typically less than 100 m/sec. 

Approach Phase 

The approach phase begins when the spacecraft enters the sphere 
of influence of the moon and extends until just prior to beginning of 
the terminal phase, a period of typically 15 hours. The objective of 
the guidance system, the disturbances to the flight path, and the 
techniques for determining the orbit and applying the guidance corrections 
are the same as in the midcourse phase. The trajectory is a moon- centered 
hyperbola with a hyperbolic excess velocity of typically 1.0 to 1.2 km/sec. 
Two or more corrections will probably be made, based on orbit determination 
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measurements which sense the position and/or velocity of the spacecraft 
relative to the moon. Examples of such observations would be on-board 
sightings of the angles between the target center and certain stars and/or 
measurement of the change in spacecraft speed as it is acted upon by the 
moon’s gravitational attraction. It is the gathering of this target- 
relative type of orbit information which distinguishes the approach 
phase. Since the ultimate mission accuracy very likely will depend on 
this information, the approach guidance phase is one of the most important 
of all. It supplies the final vernier corrections to the orbit. 

Terminal Phase 


The terminal phase begins at the completion of the last approach 
correction and extends through the final thrusting required to complete 
the mission, which might be a retro-braking into satellite orbit, a 
direct descent to the lunar surface, or a combination of these two 
maneuvers in order to descend to the surface from parking orbit. Integrated 
accelerometer data would be used during the thrusting periods, the initial 
conditions being obtained from the orbit parameters estimated during the 
approach phase. Celestial measurements and/or earth-based tracking data 
would be employed, if possible, during the coast periods. Only small 
impulsive corrections made would be during the parking orbit, if there is 
one. Thus the terminal phase is similar to the ascent- to-injection phases, 
with appropriately modified guidance objectives. 

Appendix C: The Targeting Problem 

Although the objectives of the guidance phases are different, It is 
obviously necessary that they be compatible and lead to the satisfaction 
of ultimate mission objectives. Thus an important aspect of guidance 
analysis is the targeting problem, where: 

Definition 8: Targeting a given guidance phase is the task of 

analytically and/or numerically specifying the objectives of that phase. 

Thus targeting is concerned with the practical task of piecing together 
the solutions of sequence of two-point boundary value problems so as to 
devise an overall solution of the complete problem. The targeting problem 
is almost synonomous with the guidance theory problem to analysts primar- 
ily concerned with guidance maneuvers which take the form of velocity 
impulses, while analysts concerned with continuous thrusting think of 
targeting in terms of specifying end conditions. In the terminology of 
optimization theory, targeting may be thought of as the task of specifying 
the transversal! ty conditions for any guidance phase, given that the 
trajectory has been segmented into phases. 

The conic formulae are used extensively in targeting, for motion 
during a coast period in a drag-free environment can usually be closely 
approximated, with perhaps some empirical correction terms, by the 
solutions of "patched” two-body problems. Thus for guidance purposes a 
closed form solution is valid in these segments of the trajectory, and 
the objectives of a given guidance phase can be stated as attaining a 
certain combination of orbital elements. A dynamic programming argument 
can be used to develop the sequence of desired elements for all phases 
by working backward from mission termination. For example, the objective 
of a retro thrust maneuver to obtain injection into a terminal satellite 
orbit about a planet can be specified in terms of the elements of that 
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orbit. The objectives of the approach phase can be specified in terms 
of the elements of an approach hyperbola which will be optimal for the 
retro phase. The objectives of the midcourse phase can be specified in 
terms of the elements of a heliocentric transfer ellipse which will 
yield an optimal approach hyperbola. The objectives of the near-earth 
injection phase can be specified in terms of the elements of the earth- 
escape hyperbola which will yield an optimal heliocentric transfer 
ellipse. Lastly, the objectives of the initial ascent from the launch 
pad can be specified in terms of the elements of the near-earth parking 
orbit which yields an optimal injection phase. 

The notion of a "patched" conic is not a precise one, for the 
actual trajectory is continuously attracted by many bodies. The 
guidance analyst does not consider the conics to be joined at fixed 
points on the trajectory, however, but instead they are "asympototically 
matched" in order to yield a much better approximation of the true 
motion. That is, the target planet can be considered massless for the 
purpose of injection and midcourse guidance analysis, and the position 
and velocity at closest approach to the target can then be used to 
determine the asymptote of the approach hyperbola (reference 29). The 
magnitude of the position vector at closest approach is the impact 
parameter (b), and the velocity vector is the hyperbolic excess velocity 

< V J ■ The energy and angular momentum of the approach hyperbola are then 

given by c^ * Iv^J and c^ = b | | , respectively. The errors introduced 
by such an approximation are due to the differences in gravitational 
acceleration of the target and spacecraft caused by the attraction of 
non- target masses acting during the approach phase (such as the sun), 
and are usually negligibly small compared to other sources of guidance 
system error. 

It is usually necessary to predict and/or control the time of 
flight to closest approach to the target. Consistent with the notion of 
a massless target, one might take the x^^ coordinate axis in the direction 

of the target-spacecraft relative velocity (v ) as determined on the 
standard trajectory at the standard time of closest approach (t ). The 
predicted first-order change in impact time then becomes 


fit. 




This approach to the problem is well-suited to optimization analysis, for 
the minimum time trajectory is obtained by minimizing x at the fixed time 
t... Another more commonly used technique is to employ the so-called 
linearized-time-of-f light , which is the time of closest approach to a massy 
target corrected for the non-linear effects of the impact parameter, given 
by (references 30 and 31) 

t = t (closest approach) + — — In e 

IvJ 

where e is the eccentricity of the approach hyperbola. It can be demon- 
strated analytically and numerically that t L behaves almost as a linear 
function of the midcourse correction components. 
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Appendix D: A Simple Stochastic Control Problem 

The discussion of guidance modes presented here deals with the 
deterministic guidance problem, assuming that random navigation and 
systematic errors can be treated separately. As indicated in Part 2, 
this is a reasonable assumption for many applications, but does not 
hold for problems where the random errors are large, or where the 
guidance corrections are applied on the form of impulses at times which 
are not a priori specified. In these situations the deterministic 
approach is not appropriate, and a stochastic guidance law should be 
devised . 


The stochastic control problem can be illustrated by a simple 
example. Suppose that at the initial time t Q the final state x^(t^) 
known to be of the form 


x 1 (t f ) - g (u, a) 

where u is some scalar control parameter, and a is some scalar parameter 
characterizing the motion between t and t . For example, a might be 
the initial condition x^t ), ° r magnitude of a perturbing accelera- 
tion acting between t and°t f . Suppose the navigation system has provided 
an estimate of a, denoted by a*, but there is an error in this estimate, 
denoted by E«(a*-a). Suppose this unknown error is a zero mean, Gaussian 
random variable with variance o^ over the ensemble of all similar 
experiments. The problem is to choose the u which in some sense minimizes 
x . If the estimates were perfect (e-o) , one would seek a u° such that 


(u°, ct*) - 0 
3u 

In the stochastic case, however, the error in the estimate can be 
arbitrarily large, so one must deal with the statistical expectation 
(E [•]) of the derivative. This might be expressed in the form of a 
Taylor series as 



+ higher order terms in e 11 



2 

+ [higher order terms in a ] 


where the coefficients of the Taylor series are evaluated as functions 
of u° and a*, and properties of a Gaussian distribution have been used 
in computing the expectation (i.e., the expected value of the odd moments 
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are zero, and the expected value of the even moments are expressible in 
terms of o l ) . Thus it can be seen that the statistics of the errors in 
the estimate become inseparably mixed into the optimization problem, and 
it could be difficult to find u°. The Taylor series method might not be 
the best approach, expecially for states of higher dimension. In any 
case, one Is conceptually faced with solving an infinite number of 
optimization problems corresponding to every possible value of e, and 
choosing a weighted (according to the probability of occurrence of the 
values of e) average of the solutions. 

A still more subtle problem arises if it is desired to choose 
two guidance parameters (u^, u 2 ) to minimize 

X 1 ( V = g l Cu l’ V a) 

subject to 

= 82^ u l» u 2’ = 8* ven value 

Analogous to the deterministic case and the previous example, one is 
tempted to minimize 

E [g 1 (u 1 , u 2 , a) + v g 2 (u 1# u 2 , a)] 

where v is a Lagrange multiplier. Conceptually this corresponds to 
solving an infinite number of optimization problems where the end 
condition is satisfied each time, and the Lagrange multiplier has to 
be treated as a random variable. There is no reason to expect that sue 
solutions exist, however. Alternatively, one seeks to minimize 

E ^ 8 1^ U 1* u 2’ + V E ^ g l^ u l* u 2* 

where v is^a fixed constant. In this case the end conditions are 
satisfied "on the average", that is, the expected value of the end 
conditions satisfy the constraint but individual members of the 
ensemble generally do not. 

It appears that stochastic control considerations analogous to 
those discussed in this simple example will arise in the case of 
impulsively applied guidance corrections. Some qualitative and approxi- 
mate solutions of such problems have been obtained (references 24 - 28), 
but more work in this area is clearly indicated. 
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ABSTRACT 


An approximate analytic solution is developed for minimum fuel guidance 
from an arbitrary point on a hyperbolic orbit into a specified circular orbit. The 
hyperbola must lie close to the plane of the circular orbit and its periapsis radius 
must be close to the radius of the circular orbit. Optimization of the midcourse 
impulse, the finite -thrust terminal burn, and of both maneuvers in combination 
is considered. The particular problem treated is intended as a simple example 
of a new, unified guidance technique. 
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NOMENCLATURE 


c Exhaust velocity 

e Eccentricity of nominal approach hyperbola 

f True anomaly 

F Thrust 

m Mass 

First moment of acceleration during final burn 
Second moment of acceleration during final burn 
R Radius 

t Time 

u Corrective velocity change 

V Relative velocity at nominal target interception 

K 

x Circumferential component of position at nominal target 

y Radial component of position at nominal target 

z Out-of-plane component of position at nominal target 

a Thrust angle with local horizontal 

0 Thrust angle out of orbit plane 

Gravitational constant of planet 
CO Rate of change of a 

0 Rate of change of /S 

Subscripts 

c Critical 

cy Component of critical plane correction in orbit plane 

cz Component of critical plane correction out of orbit plane 

nc Noncritical 

p Periapsis 

0 Beginning of terminal burn 

1 Centroid of terminal burn 

2 End of terminal burn 
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INTRODUCTION 


In recent years, there has been increasing interest in the use of optimi- 
zation theory for the guidance of space vehicles. Much of this work has been 
deterministic and is based upon the idea of flying a minimum fuel trajectory 
from the vehicle’s current state to the desired final state. The early work in 
this field either linearized both the state and the control around the nominal 
trajectory (second variation guidance, Refs. 1, 2) or else numerically fit the 
control to a pencil of optimal trajectories which achieved the desired final 
state (path adaptive guidance, Ref. 3). A more recent trend has been to use 
analytical solutions to optimal trajectories for various approximations to the 
full equations of motion (Refs. 4-6). The latter technique is now widely used 
for booster guidance but space-craft guidance is still largely based upon va- 
rious ad hoc approximations (Refs. 7,8). What is needed is a unified theory 
for the optimal guidance of space vehicles which can handle various missions 
and phases of flight. Such a theory might be developed by the onboard genera- 
tion of nominal trajectories with the use of neighboring extremals for some 
segments of the trajectory and analytical solutions of approximations to the full 
problem for the other segments of the trajectory. While the concept of a neighboring 
extremal may be used (as in second variation guidance), it will not be permissible 
to linearize the control (as is done in second variation guidance) because many of 
the portions of the trajectory have no control. 

The present study represents a first step in the development of a unified 
theory of optimal guidance of space vehicles. The theory is for minimum fuel 
deterministic guidance of high thrust vehicles such as an advanced kick stage. 

It treats both midcourse and terminal guidance in a unified fashion for variable- 
time-of-arrival missions. The primary application would be to orbiter missions 
and to rendezvous missions. 
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The theory is developed by considering a particular example which is 
simple enough to allow explicit development of the equations but still reflects 
many of the difficulties to be encountered in more complicated examples. The 
particular example treated is guidance in a central gravitational field from a 
hyperbolic approach condition into a desired given terminal circular orbit. The 
time for this operation is open and the initial perturbations from the nominal 
approach hyperbola are taken to be small. The nominal approach hyperbola is 
tangent to, and in the plane of the circular orbit. However, the perturbations 
around this nominal trajectory may occur in three dimensions. 

The analysis is divided into three parts. First, a general solution is 
given for an optimal midcourse correction of a trajectory which has a finite 
terminal impulsive maneuver. The special case of the problem considered in 
this paper is treated in this section. The second part of the paper is concerned 
with the motion during a finite thrust terminal burn. On the basis of this analysis, 
due largely to Robbins (Ref. 9), a guidance logic is proposed for guidance during 
the terminal phase. In the third part of the paper, the terminal maneuver and 
midcourse maneuver are considered together so that an overall optimization of 
the combined corrections may be carried out. 
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ANALYSIS 

I - Optimum Midcourse Correction 

Most existing theories of optimal midcourse guidance are for cases where 
all of the corrections are infinitesimal corrections in the neighborhood of an un- 
powered freefall trajectory (e.g. Refs. 10-13). It is necessary to extend this 
theory to the case where the nominal trajectory may contain powered arcs of 
finite magnitude. This section will consider the simplest case of such a problem. 
For this case, the nominal trajectory has a single finite impulse at its end and 
represents an optimum variable-time-of-arrival rendezvous or orbit transfer. 

At some time, it is discovered that the actual trajectory has departed by a small 
amount from the nominal trajectory and a correction must be made in order to 
insure that the original objectives of the mission will be met. In many cases, 
only a single midcourse correction will be necessary. This will be true for 
the particular example treated in this paper and is the only case that will be 
considered herein. The three components of the midcourse velocity correction 
will produce changes in three components of the terminal position and in three 
components of the terminal velocity. A special co-ordinate system has been 
used for the analysis of variable-time-of-arrival position guidance which is 
also useful in the more general problem under consideration. This co-ordinate 
system makes use of a direction known as a non-critical direction (Refs. 10-11). 
A small velocity impulse in a non-critical direction will produce changes in the 
terminal position which are parallel to the relative velocity between the vehicle 
and its target at the nominal arrival time. In the case of an orbiter mission, this 
relative velocity vector may be taken as the direction of the terminal impulse. 

At right angles to the non-critical direction is the critical plane in which the 
position deviations which are orthogonal to the terminal impulse are corrected. 

The total velocity correction in addition to the nominal characteristic velo- 
city of the nominal trajectory is given by Eq. 1. 


33 



OPTIMAL GUIDANCE FROM HYPERBOLIC TO CIRCULAR ORBITS 


E6V 


/ 


2 

u 

c 


F 

+ u 

nc 



3u 

nc 


u 

nc 


3u c 
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This is a first order expression and considers both the velocity change in the 
midcourse impulse and the corresponding changes in the terminal impulse. 

The first term is simply the Euclidean norm of the velocity changes in the 
critical and non-critical directions. The magnitude of the velocity change in 
the critical plane will be determined by the requirement that the position devia- 
tion normal to the direction of the terminal impulse must be reduced to zero. 

The component of velocity in the non-critical direction will be used to reduce 
the relative velocity and the magnitude of the terminal velocity impulse. The 
second term represents the change in the magnitude of the terminal impulse 
due to a small impulse in the non-critical direction, while the third term re- 
presents the reduction in magnitude of the terminal impulse due to a small 
impulse in the critical direction. As we are only considering first order terms, 
only the changes in velocity parallel to the finite terminal impulse need be 
considered. The optimum magnitude of the component of the midcourse impulse 
in the non-critical direction may be found by differentiating Eq. 1 with respect 
to this velocity component and setting the derivative equal to zero. Eq. 1 always 
possesses a single minimum if the nominal trajectory is optimal. The optimum 
magnitude of the component of velocity in the non-critical direction is given by 
Eq. 2, while the corresponding minimum cost due to the trajectory correction 
is given by Eq. 3. 
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Eqs. 1 through 3 represent the general linear solution for the optimum 
magnitude and direction of a single midcourse impulse at a specified location 
when there will be a large terminal impulse and when the transfer time is 
open. For the deterministic case considered herein, such an impulse at the 
earliest possible time will normally provide an optimum correction. For example, 
this is always true for the transfer from a hyperbola to a nearly coplanar circle 
considered herein. For any particular case, the optimality of a single correc- 
tion may be checked by calculating the primer vector (Refs. 14, 15) along the 
unpowered trajectory. If the primer vector exceeds the magnitude of unity 
at any point other than the initial and terminal points, an additional impulse 
will be required for an optimum trajectory. Ref. 15 suggests a method for 
calculating this multiple impulse correction. 

It should be noted that this optimum midcourse correction depends only 
upon the position deviations normal to the direction of the final impulse. It 
does not depend upon any of the velocity deviations at the terminal point on the 
trajectory. The velocity deviations normal to the terminal impulse do not affect 
the fuel consumption to first order and are neglected, while the deviation parallel 
to the final impulse is corrected by the final impulse and does contribute linearly 
to the total characteristic velocity. 

For the particular example treated herein, the nominal trajectory is a 
hyperbola which is coplanar with, and tangent to, the terminal circular orbit. 

The deviations from the nominal position and velocity of the periapsis will be 
analyzed in a Cartesian inertial coordinate system whose x-axis is aligned with 
the direction of the circumferential final impulse. The y-axis will be radial and 
the z-axis will point out of the orbit plane. The rates of change of the signifi- 
cant components of the periapsis position and velocity with respect to the magni- 
tude of a midcourse impulse are given by Eqs. 4 through 6. 
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The noncritical direction is in the plane of the circular orbit and is given by Eq. 7. 
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This is the direction of thrust which will produce no change in the periapsis radius 
to first order. It is now convenient to reference angles in Eq. 6 to the non-critical 
direction to yield Eq. 8. 
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From Eq. 8, the partial of the terminal impulse with respect to a midcourse im- 
pulse in the non-critical direction can be found as Eq. 9. 
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Eq. 9 varies monotonically with distance from the periapsis, between the two limits 
given by Eq. 10. 



nc 


The left-hand limit is the value at an infinite distance while the right-hand limit 
is the value at periapsis. The magnitude of this quantity is always less than unity, 
as it must be if the single correction is to be optimum. By using Eqs. 2, 4, and 
5, the three components of the optimal midcourse correction may be found as 
Eqs. 11, 12, and 13. 



V 2 ( 1 + e cos f )( 2 + e - cos f ) - 2 ( e + 1 cos f 


(13) 

Finally, the total additional cost of the midcourse impulse and the change in the 
magnitude of the terminal impulse due to position deviations in the periapsis is 
given by Eq. 14. 
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It is interesting to compare the fuel consumption due to this optimum 
correction strategy with the fuel consumption of a technique which has been 
used previously, namely the minimization of the magnitude of only the midcourse 
impulse. Figure 1 illustrates the total magnitude of velocity corrections due 
to a midcourse correction impulse at the illustrated locations and in the illus- 
trated directions. The magnitude of each arrow represents the total corrective 
velocity (Eq. 14) necessary to correct a unit displacement in periapsis altitude. 
The arrows above the hyperbola are the corrections necessary to raise the 
periapsis while those below the hyperbola are those necessary to depress the 
periapsis. At each point there are two arrows which are diametrically opposite. 
These arrows are in the critical direction which minimizes the magnitude of 
only the midcourse impulse. The arrows are not of the same length because 
the change in the magnitude of the terminal impulse due to raising or to lowering 
periapsis is of opposite sign. The two shorter arrows at each point represent 
the optimum corrections. The directions of these arrows are symmetric with 
respect to the non-critical direction which is orthogonal to the critical direction. 
The figure shows that the optimum correction strategy always saves fuel rela- 
tive to the other strategy and that the relative fuel saving becomes quite large 
as the vehicle gets closer and closer to periapsis. 
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II - Terminal Maneuver 

The finite thrust terminal maneuver for injection into the circular orbit 
cannot be approximated as an impulse because of its relatively long duration. 
The guidance for this terminal maneuver is developed by utilizing an approxi- 
mate solution of the equations of motion. This approximate solution is a direct 
application of a theory due to Robbins (Ref. 9). It represents a second order 
expansion of the impulsive solution for the optimal trajectory in terms of in- 
verse powers of the average acceleration. Robbins' theory can theoretically 
treat any thrust acceleration schedule because it is based upon the moments 
of the acceleration about a centroid time. For the practically important case 
of constant thrust, the moments about the centroid of the finite thrust burn 
are given by Eqs. 16-19, while the characteristic velocity of the maneuver 
is given by the standard rocket equation, 15. 
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Robbins’ analysis is carried out in an inertial Cartesian coordinate system 
whose origin is at the location of the nominal impulsive burn and whose x-axis 
is aligned with the direction of the impulse. For the present problem, the 
impulse is circumferential and that will be the direction of the x-axis. The y 
axis will be taken to be radial and the z-axis will be taken to be normal to the 
first two axes. The analysis in this rectangular co-ordinate system is then 
developed by ignoring all powers of the inverse of the average acceleration 
higher than the second. Robbins shows that, for the time open case, this 
assumption implies that small angle formulas may be used, that the optimal 
thrust direction is a linear function of time, but that the gravity gradient in 
the nominal direction of the impulse must be considered in the analysis. With 
these assumptions, the equations of motion are given by Eqs. 20-22. 
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z = --(/5+nt) ( 22 ) 

m 1 

The terminal boundary conditions are such that the vehicle will end up 
on a circular orbit at the desired nominal radius. In this Cartesian co-ordinate 
system and with these approximate equations of motion, these boundary conditions 
are given by Eqs. 23-27. 
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The integration of the equations of motion may be carried out by standard 
techniques to yield Eqs. 28-32. 
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There are five quantities that must be determined in order to meet the 
five desired terminal conditions. These quantities are the directions of the thrust 
at the centroid time as well as the rates of rotation during the burn and the total 
duration of the burn. To this order of approximation, these quantities are given 
explicitly by Eqs. 33-37. 
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This completes the development of the equations of motion during the 
terminal burn as well as the solution of the boundary value problem for control 
of the vehicle. It is still necessary to make a feedback control law out of these 
equations and to determine the times at which the thrust should be turned on and 
turned off. An accurate way to create a feedback control law is to continually 
redefine the position of the centroid on the basis of the estimated time-to-go 
to the completion of the burn. This will cause the Cartesian co-ordinate system 
to rotate and change its origin during the burn but will cause the solution to 
become progressively more accurate as the terminal time is approached. With 
this type of feedback control, the engine may be turned off whenever some mea- 
sure of the terminal error becomes small enough. 

The optimum time to turn the engine on may be determined by noting that 
Robbins' theory predicts that the centroid of the final burn should be at the peri- 
apsis of the hyperbola. The required length of the burn may then be determined 
by substituting the control Eqs. 33-36 into Eq. 37 and referring all quantities to 
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the perigee of the unpowered hyperbola, yielding Eq. 38. The last term in 
Eq. 38 is of higher order than the other terms and may generally be neglected. 
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III - Optimization of the Combined Maneuver 

The first section of the analysis considered the optimum midcourse cor- 
rection if there was going to be an impulsive terminal burn. The second section 
then developed the equations of motion and the cost for a finite thrust terminal 
burn. This final section is intended to determine the minimum fuel consumption 
for the combined maneuver with a midcourse impulse and a finite thrust terminal 
burn. With a finite thrust terminal burn, it is possible to produce changes nor- 
mal to the nominal thrust direction during the burn, so that part of the position 
correction may be done during the terminal burn and part may be done during 
the midcourse burn. The total additional cost, over and above the AV of the 
nominal impulsive hyperbola, is given by Eq. 39. 
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The only unknowns in Eq. 39 are y and z which are the position displace- 

P p 

ments at perigee after the midcourse impulse. All the other terms in Eq. 39 
may be considered as constants and they have all been previously determined 
in this paper except for the one given by Eq. 40. 



(40) 


The optimum values of y^ and may then be determined by differentiating Eq. 39 
with respect to these two quantities, and setting these derivatives equal to zero. 
This produces two simultaneous quadratic equations for y^ and z^. These si- 
multaneous equations may be solved by standard algebraic techniques, either 

analytically or by iteration. Once y and z have been determined, the components 

P P 

of the midcourse correction may be determined from the formulas in Section 1 
by substituting the changes in these quantities for the total corrections considered 
in Section 1. The terminal maneuver is then carried out according to the sugges- 
tions developed in Section 2. Eq. 39 may also be used to determine the relative 
cost of having a midcourse correction or of correcting all the position components 
during the terminal burn. These comparative costs may then be used to deter- 
mine the desirability of performing a midcourse correction. 
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CONCLUSIONS 


1. The optimum midcourse guidance correction for rendezvous or orbit trans- 
fer may be found by a simple modification of the standard calculations for 
position guidance. 

2. An accurate near-optimal terminal guidance scheme can be developed ex- 
plicitly using Robbins' theory of near-impulsive transfers. 

3. A unified theory of minimum -fuel guidance can be developed for a large 
class of missions. 

POSSIBLE EXTENSIONS OF THE ANALYSIS 

1. This problem can be extended to nonlinear planar corrections by using 
the results of Horner (Ref. 16). The adjoint solutions corresponding to 
small planar corrections are identical with Horner's. 

2. It should be possible to generalize the analysis to arbitrary time-open 
maneuvers in the neighborhood of a given time -open minimum-impulse 
maneuver. 
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1. INTRODUCTION 

In a previous paper [ 1] we have discussed the existence of controls p that 

minimize a function x° subject to the restrictions that, for every value of its 

£ 

argument t in a metric space, p(t) is contained in some preassigned set R '(t) and 
that x(p)eB^, where x is a given mapping and is a closed subset of a topological 
space H. We have shown that, in a large class of problems, such minimizing controls 
exist in a larger space of "relaxed controls" and that these relaxed controls can be 
approximated by original controls . 

In this paper we shall assume that H is the euclidean n-space E . We wish 

n 

to investigate certain necessary conditions for minimum that might be considered a 
generalization of the Weierstrass E -condition and of the transversality conditions of 
the calculus of variations. In this sense our results represent an extension of 
certain methods and theorems of the mathematical control theory, and specifically 
of references [2] and [3] , to a more general setting. The necessary conditions that 
we obtain are no longer restricted, however, to minima over the space of 
relaxed controls but apply as well to minima over the space^^P of original controls 
(if such minima exist). Thus our present results also generalize Pontryagin' s 
maximum principle. Furthermore, the spacej^* is no longer restricted, as in [1], 
to measurable mappings from one metric compact set to smother. 

Previous attempts to apply the methods of the mathematical control theory to 
problems involving functions defined otherwise than by a system of differential or 
difference equations were mostly limited to special, and linear, problems. Recent 
results of Neustadt [4,5] are, however, quite general. They are based on a 
separation theorem for convex sets that represent certain linearizations of 
constraints. Our approach is, however, different from Neustadt' s; in particular, 
our basic results are stated directly in the form of inequalities involving the value 
of the minimizing control at an arbitrary point rather than in the form of functional 
inequalities . 
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Let T and R be arbitrary sets, B a convex set, Jyl a class of controls, 

that is, mappings from T to R, x = (x\ x 11 ) a given function from^x B to 

E , and B, a given set in E . We wish to characterize a control‘s e yr and a point 
_ n 1 n i ^ 

b € B that yield a minimum of x (p, b) subject to the condition x(p, b) eB^ . The 

necessary conditions for minimum that we derive are expressed in terms of certain 

variational derivatives Dx(p,b; t*,p*) respectively Dx(p,b; t*,r) defined in 

section 2. These derivatives represent, roughly speaking, the rate of change of x 

when its argument p is replaced by the function p* (respectively the constant 

function r) over a "small" set in the "neighborhood" of t*. 

As an illustration, we consider, in sections 3 and 5, the "standard" problem 
of the mathematical control theory of ordinary differential equations and prove a 
slight generalization of the usual necessary conditions. 

2. NECESSARY CONDITIONS FOR MINIMUM 

Let T and R be arbitrary sets, B a convex set, an arbitrary class of 
mappings from T to R, and B^ a set in E^. The vector function x = (x*,...,x n ) 
is a given mapping from^^x B to E^. If p :T -*-R, we denote by p(t) the image, 
under the mapping p, of a point t in T. If the mapping p depends on some parameters 
a,b,c, we designate by p(a, b, c), or by p(* ;a, b, c) the mapping, and by p(t; a, b, c) the 
image of t under the mapping. Similarly x denotes a mapping and x(p) the image of p 
under the mapping x. We also write, when it appears more appropriate, t -*p(t) to 
represent a mapping. 

If p^ and p 2 are two mappings from T to R, and A is a subset of T, we 
designate by [p^, A;p 2 J the mapping p defined by the relations 

p(t) = P x (t) on A, p(t) = P 2 (t) on T-A. 
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Similarly, if P^P^, . . . ,P^, pare mappings from T to R, and A^, . . . , are 

disjoint subsets of T, we designate by [p 1 ,A.(i=l t . . . ,k); p] the mapping p defined 
by the relations 

p(t) = P|(t) on A.(i=l, . . .,k), p(t) = p(t) elsewhere on T. 

Let T* be a subset of T, and let *At be a collection of subsets M(t, a) of 
T(t e T*, ot > 0). Let p p* , b € B, t* e T, a > 0, and let 
p 1 = [p*, M(t*, a); p] . 

If p ' ejp for sufficiently small a, and if 

lim “ (x(p\b) - x(p , b)) 
oi •“ + 0 

exists, we shall say that M x has an -derivative at (p,b) with respect to (t*,p*)" 

and we shall designate this limit by x(p,b;t*,p*). If $ *(t*) is a subset of $ 

for each t* e T* and x(p, b;t*,p*) is the same for all p* such that p*(t*) 

we shall write 

x(p,b;t*,r) or Dx(p,b;t*,r) 

(if#fj£ and the mapping t* *(t*) are fixed). 

Let now p beB, and b e B. We shall write Dx(p,b;b) for 

lim \ (x(p, (l-0)b + 0b) - x(p,b)). 

0 — +0 9 

Definition 2.1. Local variations in jPxB. 

Let pe^.bfB, T* C T, let ^ be, for k=l, 2 n , a collection of 

subsets N^(t, Of) of T(t € T*, a > 0), and ~ lk=l, . . . , n 2 J\ Let^ be 

a mapping from T* to the class of subsets of & . 
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We shall say that (T*, ^ define "local variations for x in ^ x B at 

(p,b)" if the following conditions hold: 


(2.1.1) For t*,t*,t* eT*, ^^,^=1,2 n 2 , and<*,/9>0, 

N fc (t* , a) C N k (t* , 0 ) if a ^ 0; N fc (t* , 0) 

is the empty set; N (t*,a) and N (t*, 0) are disjoint if k ; and N, (t* a) and 
k l k 2 k l 1 

N (t*,£) are disjoint if t* and both a and 0 are sufficiently small. 

2 2 

(2.1.2) Let an array with elements /3 1J (i,j=l, ... ,n) be represented by 0° . 
Let, for every choice of t D with elements t^ e T* and of p D with elements 

p^ the set ft ~ ft(t n ) in E contain all arrays with > 0 which are 

ii ii ^ 

such that the sets N . , .(t , w ) (i, j= 1, ...» n) are disjoint, and let 

7? ii ] i 

p'=p'(t°,p°,u D ) = £p x J. N n j_ n+i (t J . 0 J j )(i, j= 1 n);p] for oP e J2(t a ). Finally let 

b D have elements eB , <P have elements 

0 ij = 0,^=| 9° lfl‘ j S 0, 

* i 

and 

e a .b°= «°b + 

i. J =1 


I 



Then 

(2.1.2. 1) p'e^>; 

(2. 1 . 2. 2) for fixed t D , p D and b 0 , the function 

(w D »<? D ) — 0°, t D , p D ,bP) = x(p' (t a ,p D , aj D ), 0 D .b D ) from fix & to 

is continuous in some neighborhood of (0 D , 0 a ), and has a differential at (0 D , 0°) 
(relative to ft x er>; 
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(2.1. 2.3) for every t* eT*, p* e^p*(t*), andk=l,2, , n 2 , 

D^-^x (p,b;t*, p*) = D x (p,b;t*,r) exists, is independent of k, and has the same 
value for all p* such that p*(t*) = r. 

We can now state our general necessary conditions for minimum which we 
shall prove in section 4. 

Theorem 2.2. Let (p, b) yield the minimum of x 1 (p,b) in^* x B subject to the 

condition x(p,b) e B^. Let j{/) define local variations for x in $ x B at 

(p,b), and let, for all t* e T*, R*(t*) = jp*(t*) I p*€^(t*)} . Let, furthermore, BJ 

be a convex set in E^, b * a point in B*, and 0:B* -»B a continuous mapping such 

that <p( b*) = x(p,b), 0(B*) C B^ and 0 has a differential at b* (relative to B|) 

d0(b*;b* -b*) = 0 *(b*)(b* - b*) (where 0 (b*) is a linear operator from E to E ). 

til 111 b* 1 mn 

Then either 

(2.2.1) ^ ( b*)b*= Min /*(b*)b* , 

1 b l e B 1 1 

or there exists a nonvanishing vector X in E^ such that 

(2.2.2) X - Dx(p,b;t*, r) > 0 for ail t* eT* and r eR*(t*); 

(2.2.3) X ■ Dx(p,b;b) ^ 0 for all b eB; 

and 

(2.2.4) (M°6 - X) T<?> (b*)b*l = Min (mV 

L 1 J b* € B* 

for some p° o, where 6 = (1,0 0) eE . 


- *> • ] 
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Remark . Relation (2.2.2) generalizes the Weierstrass E-condition, relation (2.2.3) 
generalizes the transversality conditions at the initial point and describes the 
dependence on parameters, and relation (2.2.4) generalizes the transversality 
conditions at the endpoint. 

4t 

Theorem 2.2 is of particular interest in the case [ 1] when t R (t) C R is 

a given mapping from T to the class of nonempty subsets of R, and ^^is the class of 

measurable relaxed controls u such that the probability measure o(t) is supported on 
# 

the closure of R (t) for all t e T. We may then assert [1, Th. 2.6] that in a large 
class of problems there exists a relaxed control <j and a point b that yield the restricted 
minimum assumed in Theorem 2.2; and we may verify a priori the other assumptions 
of Theorem 2.2. We are then able to state that a minimizing control cr and point b 
exist and either satisfy condition (2. 2.1) or conditions (2.2.2), (2.2.3), and (2.2.4). 
Since these relations often admit only a finite number of solutions, we can determine 
a minimizing (J and b; in this sense, [l,Th. 2.6] and Theorem 2.2 often provide 
constructive conditions for minimum. 

3. FUNCTIONS OF CONTROLS DEFINED BY ORDINARY 
DIFFERENTIA L EQUATIONS 

We shall now illustrate the use of Theorem 2.2 in certain standard problems 

of the control theory, postponing the proof of the results presented in this section to 

section 5. Let T be the closed interval [ t , t ] of the real axis, R a separable 
# o 1 

metric space, R a mapping from T to the class of nonempty subsets of 

R> B C E , B C E , and g:E xTxR-*E . In this section, and in section 5, the 
o n 1 n n n 

words measure and measurable will be used in the sense of Lebesgue and |A I will 
represent the measure of A C T. 

Let^ r be a class of mappings p:T — R such that t — g(v,t,p(t)) is measurable 
on T for every v e and pe^ and [ p., A.(i=l, . . . ,k);p] if k is a positive integer, 
each A. is a denumerable union of intervals, and pe&' , p.e^' (i=l, . . . , k). We shall 
henceforth refer to elements of as "measurable" mappings (as distinguished from 
measurable mappings). We set 01 = {/>£$' I P(t)e R # (t) on T j . 
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For p and e B^, we consider an absolutely continuous function 
y = y( • ; p, b^) on T such that 

{3.1. 1) dy(t)/dt = y(t) = g(y(t), t.,p(t» a. e. in T 

and 


(3.1.2) y(t o )=b Q . 


We wish to investigate certain properties of a point b^ € B q and a mapping 
pe^ that minimize y 1 (t^;p, b^) on $ x subject to the restriction that 

y (t i ;p, V e Bl - 

We shall say that a sequence |M ^ of closed subsets of T is "regular at t" 

if |M. I — * - 0 as j — * oo , t cM,, and diameter (M.)< c|M I for some positive c and 
3 3 3 J 

all j=l,2 We shall say that a "measurable” mapping p*:T — >R is "admissible 

at t" if p*(t) € R^(t) on T and lim . f g(v,t,p*(t)) dt = g(v,t,p*(t)) for all 
i ■ ■ oo I . ! J IV1. 

J j j 

ve and all sequences {M^ that are regular at t . 

We set 


R*(t) = | p* I p* is admissible at t { 


and 

R*(t) = |p*(T) i p* zjl *<T)( . 

We shall also write R*(t), M. , etc. to represent closures of the sets R*(t), M., 

etc. 

Assumption 3.2 . For every v e E^, reR, and teT, the function g(v,t, ■ ) is continuous 

on R, g(v, * ,r) is measurable on T, and g(* , t, r) is continuous and has continuous 

first order derivatives on E . Furthermore, for every v in E there exists an 
n J n 

integrable function on T such that lg(v, t, r) I ^ j^(t) on T x R. Finally, for every 
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bounded subset D of E there exists an integrable function ip. 


|g v <v, t, r) I i V' D ( l ) on I) x T x R. Here Ig 1 = 
n 

(agVav 3 ), and lg y ! = . £ ^ U)g 1 /0v i I. 


r 


on T such tha* 

g is the matrix 
b v 


Remark. Assumption 3,2 implies that we may choose as the class of all the 
measurable functions from T to R. 


Theorem 3.3. Let (p,b ) minimize y*(t. ;p,b ) among all points b in B and all 
o l^o o o 

"measurable" mappings p such that p(t) eR (t) on T and y(t^;p, b^) eB^, and let 

Assumption 3.2 be satisfied. Let y = y(. jp.b^), b^ = y(t^) and let, for k~ 0, 1, B* 

be a convex set in E and (f>. = (<h }, .... dy?) : E — - E a continuously differentiable 

m, ^k ^ k y k' m, n 
k _ _ k 

mapping such that <£ (B*) C B and <p> (b*) = b for some b* fB*. Let A, be, for 

K . K . K K K K K K K. 

k= 0, 1, the matrix (Otf>jVt>b* J ) evaluated at b*, and let A^ be the i-th row of this 
matrix. Then either 


(3.3.1) A } * b* : 


Min A 

b r B t 


b * 


or there exists an absolutely continuous function z:T — E such that 

n 

(3.3.2) y(t) = g(y(t), t,p(t)) a. e. in T, 

(3.3.3) z(t) = - g^(y(t),t,p(t))z(t) a.e. in T 
T 

(where g^ is the transpose of the matrix g^), 


(3.3.4) 

1 z(t) I j- 0 on T, 


(3.3.5) 

A t) • g(y(t),t,p(t)) = 

Min z(t) ■ g(y(t), t, r) a.e. in T, 



reR # (t) 

(3.3.6) 

7 . (t ) • A b* = Min 

° 00 U* , 

z(t ) * A b* , 
* o' o o 
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and 


(3.3.7) Cyfij-z^)) • A x b* ^ Min (yfi^ C^)) ■ A^* 

b* e B* 


for some y > 0. Hence 6^ 5 s (1, 0, . . . , 0) € E^. 

In particular, if R (t) = R on T and ft ' contains the constant mapping t — r 
for all r in a dense denumerable subset R of R, then R*(t) can be replaced by R in 

OO 

statement (3.3.5). 

By combining [1, Th. 3.1} and Theorem 3.3, we can prove the existence of a 

minimizing relaxed control a and a point b and can state some of their characteristic 

o 

properties. We require 


Assumption 3. 4. 


(3.4, 1) R is compact. 

(3.4. 2) is compact and B^ is closed. 

(3.4. 3) There exists an integrable function if> on T such that 
fg v (v, t, r) | 0(t) on x T x R. 

(3.4.4) Let ={ p:T— R I p(t) e R^(t) on T and p is measurable } . Then for 

every e there exists a closed subset T of T, of measure at least |T | - e, with the 

— € — #— # 

property that (a) for every t e T^ and every reR (t) there exists a mapping pe R , 

continuous at t when restricted to T^, and such that the distance from p(t) to r is at 

most e; and (b) for every t e T and every h > 0 there exists a positive 6 = 6 (h, t) such 

# # — ^ — 
that R (t) and P (t) are in the h-neighborhood of each other if teT^ and It - t | ^ 6 . 

Now let S be the class of regular Borel probability measures on R. It is well 
known [9, p.426] that a metric can be defined on S such that S is separable and the 
convergence in S is the weak convergence of measures: that is, a sequence s^ s s^, . . . 

converges to s in S if J c(r)s.(dr) — * J c(r)s(dr) as j oofor every continuous 

J R j J R 
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# — £ 
c:R -*E . Let xi- be the set of mappings a from T to S such that o(R (t) ; t) = 1 on T 

and t — * c(r)cr(dr;t) is measurable on T for every continuous c:R — E^. Here 

o(R' ; t) is the o(t)- measure of a subset R' C R„ 

We refer to an absolutely continuous function £:T — - as a "relaxed curve" 

if £(t Q )eB o and £(t) belongs, a.e. in T, to the convex closure of the set 
) g(£(t), t f r) ! r e R^(t)}. This definition is equivalent, in view of our assumptions and of 
[1, Th. 3. 1] , to the statement that £ = £(• ;u»b Q ) satisfies the relations 

£(t) = \ g(£(t), t, r)u(dr;t) a.e. in T, 

J R 

« (t o> - V 


for some and b eB . (This definition is also consistent with the one in 

o o 

[ 1, section 3] for A- E .) 

We can state 


Theorem 3.5. Let B^, B^, T, R, R , and g satisfy Assumptions 3. 2 and 3.4, and 
assume that £(t ;a' ,b')f B, for some v 1 € A/ and b’ e B . Then there exist a relaxed 

1 - 1 i ° ° t 

control ct and a point b cB that minimize £ (t ;<r, b ) on ^ x B subject to the 
o o 1 o' o 

condition that £(t^;cr, b^) eB^; and the corresponding minimizing relaxed curve 

£ = £ ( • ;(J, b^) can be uniformly approximated on T by a sequence £ , £^, • • • of 

absolutely continuous curves such that £.(t) = g(£.(t), t,p.(t)) a. e. in T (j= 1, 2,. . . ), 

1 ^ J 1 

the mappings p^ are measurable, and p.(t)cR (t) on T. 


Let f(v, t, s) =/ R g( V , t, r)s(dr) on E n x T x S, let S*(t) -jseS |s(R*(t)) -- l} for 

teT, let b, - £(t :a, b ), and let E , B*, <f>, , and A, be defined as in the statement 
11 o' m, k k 

of Theorem 3. 3. Then either condition (3, 3.1) or conditions (3. 3. 2) through (3. 3. 7) 

— — * 

of Theorem 3. 3 are satisfied, with y,g,p, R , and r replaced by, respectively, 


£ , f, u, S*, and s. 
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Furthermore, condition (3.3.5) of Theorem 3.3 implies that, a.e. in T, 

(3.5.1) z(t) • g(?(t),t,r) - Min z(t) • g(|(t),t,r) 

reR 

- - # 
for every r in the support of cr(t), if R (t) = R on T. 

4. PROOF OF THEOREM 2.2 

The proof of Theorem 2. 2 is essentially contained in the lemma that follows 
and that resembles, in many respects, Lemma 3. 1 of [ 3, p. 132] . The convex set 
W is patterned after a construction of McShane [6, pp. 17-18] . Brouwer' s fixed point 
theorem appears to have been first applied in a similar context by H. Halkin (7, p. 75] . 

Lemma 4.1. Let (p,b) minimize x*(p, b) in ^ x B subject to the conditions 
x^(p,b) = 0(£ = 2, . . . , n). Let define local variations for x inj£x B at 

(p,b). Then there exists a nonvanishing vector A in such that A* > 0, 

(4.1.1) A • Dx(p, b;t*, r) > 0 for all t*e T* and re R*(t*), 

and 

(4.1.2) A • Dx(p , b;b) > 0 for all b e B . 

Proof . We shall use the notation that we have introduced in section 2. Let 

V = {Dx(p,b;t*, r) |t* eT*, reR*(t*)[, = jDx(p,b;b) Ibe b} , and let W be the convex 

cone in E generated by V. U V_.; that is 
n 12 

W : =|a 1 v. + . . . +a n v Iv.eV , U V_, a 1 5* 0(i= 1 n) j . 

*1 n i 1 2 ’ 

Assume now, by way of contradiction, that there exists no vector A with the stated 
properties. Then we can easily deduce from elementary properties of convex sets 
that there exists a point w = (w\ 0, . . . , 0) in the interior of W, linearly independent 
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vectors (points) w. € W, and positive numbers c*(i= 1, . . . , n) such that 

n 

(4.1.3) W 1 < 0 and w = ^ cV. 

i= 1 

By the definition of W, there exist points t 1 e T*, controls p l c (t* ) , 

points b 1 * 1 e B and numbers a 1J 5 = 0 (i, j= 1 n) such that 

n 

(4.1.4) w. = ^ * Dx(p, b;t^,p^ f b^) (i= 1 n) 

}=}_ .. 

where, for each i,j, Dx(p, b;t^ f p 1 ^, b^) either represents Dx(p, bjt 1 , p 1 ^) (and is 

independent of b 1 ^), or represents Dx(p,b;b^) (and is independent of t 1 ^ and p 1 *). The 
£ 

matrix (w.) (i,£= 1, . . . ,n) is nonsingular since the vectors w. are linearly independent. 

Vl i 2 j 2 

Let now of be sufficiently small so that the sets (t , o') and (t ,/3) 

^1^1 *2^2 — — * — ^ 
are disjoint if (k^t ) f (k , t ), a ^ a and p * a , let a < 1, and let 


A = 

<5eE \0^6 1 ^a/l > a k M(i=l,. 

n 1 / 

. .,n) 


\k, j-1 / 



For every 6 eA , let ^(<5 ) = a* J S 1 and 0 1J (<5 ) = 0 respectively a/ J (6 ) = 0 and 

0^(<5) = a ^6 if Dx(p, b;t* , b ^) represents Dx(p, b;t , p 1 ^) respectively 

Dx(p,b;b^). We observe that the sets N (t , m^( 6)) are disjoint and 

nj + i-n 
n 

y 0 lj (6) - 1 for 6 cA . 

i,3=l 

We now consider, for each A , the "perturbed" mapping p* (6 ) = p ? (t D , 
P D > cj d (6 )) = [p lJ . N n j_ n+ ^ tl )) (i, j= 1, . . . , n); pj in $ and the "perturbed" 

point b’ (<5 ) = 0 a (S) • b a in B. Bv condition (2. 1.2,2), the function 
6 — * £(6 ) = £( cu a (6 ) , 0° (6 )) = x(p* (6 ), b T (6 )) from A to is continuous in some 
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neighborhood A (relative to A) of the origin 0 of and has a differential at 0 
(relative to A). Furthermore, by (4. 1.4), the right-hand derivative 


3 ?(o )/35 k ! = V id£ (,.P (& L n D f6 W/B,.}* 


I = / (3«(« a (5).0 a (5))/8w J 3a; (6)/36 i 9(^(5), 

5 = 0 

i,J = l 


Q°(6 ))/a0 1J *30 13 (6 )/96 k ) 


6=0 


Dx(p,S ; t kj ,p kj ,b kj )a kj = w k (k=l, 


j=l 


Thus the jacobian matrix £^(0) = (a£* (0)/36 3 )! 1 ._ ^ is nonsingular. 

Let a(S ) = (|.(0)r 1 (i(6)-4 (0) -7 (0) • 6), and let c = (c 1 c“). 

O 0 

We shall now show that the equation 
(4.1.5) 5 = yc - a(6) 

has a solution 6 (y) for all sufficiently small positive y. Indeed, since £(•) has a 
differential at 0 (relative to A), there exists a positive p ^ such that 

c • 

■ 1 . . | 1 min r 

la (6) I ^ - 5 

4 c max 


and 


where 


5 eA' if 0^6 « p (i= 1, . ..,n), 


c . = Min c , c = Max c , and <5 = Max 5 . 

min . max . max 


Let 


and 


0<p^p,y = ~p/c 
^ 2 max 


A” ={6eAM I6 1 - yc 1 1 ^ \ 7 c min (i= 1, 2, . . . , n) } . 


Then we can easily verify that A M is homeomorphic to a closed ball in E^, and 


• * • * n ). 
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y c - a(6 ) is a continuous mapping of A" into itself. Thus, by Brouwer’ s fixed point 
theorem, there exists <5 = 5 (y) satisfying equation (4. 1. 5). It follows then from 
relations (4. 1.3) and (4. 1.5) that 

n 

S(6(y)) - 1(0) = y£^(0) c = y ^ C l w. =yw= (yw\ 0, . . . , 0); 
i= 1 

hence 

H l (6 (y)) = iV) + yw 1 < ?\o) = xV.b) 

and 

?<S(y» = 0 (f “ 2 n). 

Since p’ (5(y)) e & and b' (6 (y )) e B for all p /3 q and 6 eA^ C A and since 
6 (y ) t A" and £(d(y))= x (p f (<5(y )), b’ (6<y))), we conclude that, contrary to 

y 1 

assumption, (p, b) does not minimize x (p,b) subject to the restriction that 
x (p,b) = 0 (£ 2, .... n). This completes the proof of the lemma. 

4.2 Proof of Theorem 2.2. Let c = (b, b*) for be B and b* c B* , let c = (b,b*), and 
let C = B x B*. Then (p,c) minimizes x\p,b) on a x C subject to the restrictions 
x*(p,b) - </(b*) = 0 ({= 1,2, .. ,,n). 

Let the function y = (y , y\ . . . , y°) on Jg x C be defined by 
y°(p*c) = y°(p» b, b*) = x 1 (p,b), 

y £ (p,c)= y* (p, b, b*) = x* (p, b) - </(b*) (£=l,...,n). 

Then we verify that (T*, $ *,f) define local variations for y in^ x C at (p, c). It 

0 1 n 

follows then, by Lemma 4. 1, that there exists a nonvanishing vector p= (p ,p , . . * ) 
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in E and a vector X = (p° + ju 2 , . . . ,p n ) in E such thatu 0 3s 0 and 
n+ 1 n 

(4. 2.1) p • Dy(p, c;t*, r) = X * Dx(p, b;t*, r) ^ 0 for all t* e T* and re R*(t* 

and 

(4.2.2) ix * Dy(p,c;c) = fX°Dx l {p, b;b) + ( X - M° 6 1 ) * (Dx(p,b;b) - 

4>b*(b*)( b * - b*)) ^ 0 for all beB 

and b* e B*, where <5^ = (1, 0, ... ,0) e E^. We observe that Dx(p,b;b) = 0; hence, 
setting b = b in (4, 2.2), it follows that 

(X - p 0 ^) • </> b ^(b*)(b* - b*) « 0 for all b* eB*. 

Since p is nonvanishing, either X is nonvanishing or p = (p°, -p°, 0, . . . , 0), p° > 0, 
and 


<f, b* <b i )b i = Min 0 b* (b i )b i • 


5. FUNCTIONS OF CONTROLS DEFINED BY ORDINARY 
DIFFERENTIAL EQUATIONS 


Proofs . We shall use the notation of section 3 and we shall make, at first, the same 
assumptions as in Theorem 3.3. 

Let, for any integrable function f from T to some euclidean space, T’ (f) be 
the set of all the points t* in T such that |f(t*) I is finite and 


lim 


1*1 


f(t)dt = f(t*) 


M. 

1 


for all sequences { of closed subsets of T that are regular at t*. It i 

known [8, Th. (6.3), p. 118] that the set T* (f) has measure |T I. 


is well 
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Now let D° be a bounded convex open set containing the range of y, let 

it = i 0 be defined as in Assumption 3.2, let D be a dense denumerable subset of 
D 00 — 

D° and R a dense denumerable subset of U R*(t), and let g(v, * ,p( )) be the 
00 _ ter 1 

function t — g(v, t, p(t)) . Then 


T* = (T> (g(v,*, r))fT T' (g(v,*,p(-))))nT T <* o )n t 1 ) 

veD , reR 

oO OO 

has measure IT I. We also verify that T' (g(v, •, r)) j T* and T <g(v,- ,P(-)))0 T* for 
all veD° and re R . Indeed, let t*tT», veD°, reR , and let v , v , . ... be a sequence 

00 00 i 

in D converging to v. Then, by Assumption 3.2, 

oO 


lim sup 
3 <» 


1 

IM. ! 

J 



g(v,t, r)dt - g(v.,t*,r) 


lim sup j C !g(v, t, r) - g(v., t, r) I dt < 
j — « J 

lim sup I v - v. I i^ 1 | l ^ Q (t)dt = 0 o (t *) | v-v. | (i= 1 , 2, . . . ), 

j — » J 


for every sequence { M.j that is regular at t*. Since g( . , t*, r) is continuous on D , 
we conclude that T' (g(v, • , r)) D T*. We similarly show that T’ (g(v, • ,p(-))) ^ T * 
for all veD°. 


We next define sets N R (t, a) and the corresponding collection 


J: 


2 

Let m = n , 


and let 


-mi+m-k+ 1 


k-1, i 


- u 

i= 1 


( ”k,i’\-i,i ] ' 


and 

N k (t) = (t+ O T(teT;,k=l m; i=l,2,...). 
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We then define jyt, a) for a 5 0 as N k (t) n (t, t + /^(t, a)] , where /? k (t, a) is 
nonnegative and such that liyt, a)\ = Min(a, |N k (t)|). We observe that diameter 
( N k (t* »)) « 2 m |N k (t, Of) I for all t and a, and [jyt, a) f = a for sufficiently small t 
and t < 

We shall henceforth use the above definitions of T* and as well as the 
definitions of<$* and R* of section 3. We shall also use the notation of Definition 
2.1. Let t 1J eT* and p 1J e£?*(t^) (i,j= 1, . . . ,n), and let 


p f = p’< t ,p ? ,oT )= [p 


’ Vn + i (t > M" 1 - 


» k :;p] 


for all fl(t n ). Finally, let B = B*, b = b*, and for pe^f and beB, let the 
absolutely continuous function y = y( • ;p,b): T be the solution of the system 


y(t) = g(y(t),t,p(t)) a. e. in T, 


y(t Q ) = <P 0 ( b). 


It follows from Assumption 3.2 and from well known theorems that there 
exists a neighborhood B of b in B such that the function y, as just defined, exists, 
is unique, has its range contained in D°, and depends continuously on b, uniformly 
in p, for all b e B and all "measurable" p such that the set {t eT 1 p(t) ^ p (t) } 
has a sufficiently small measure. 

Lemma 5.1. Let p' = p' (t n ,p D . u D ). For all teT, beB, t lj eT*, and 
P 13 e<*f*( t‘ J ) ,n), the function b) — y(t;p' , b) is continuous in some 

neighborhood r of (O a ,b) in S2(t a ) x B; and, for all i, j= 1 n, the limit defining 

the right hand derivative of y(t;p' , b) with respect to qj at 0 exists and is uniform 
in r, and this derivative is a continuous function of (^b) in r. Similarly, 

lim £ <y(t;p' , b +9(b -b » - y(t;p\b )) 

0 — + 0 y l £ 1 1 
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defining the right hand derivative 


ay(t;p» , b + 6 ( b - b ))/ae| 

1 le=o 

exists, this limit is uniform in T x r, and it is continuous in r x T. 

Finally, letx(p,b) = ytt^p.b). Then 

D x(p, b;t*, p*) = Dx(p, b;t*, p*(t*)) = Z(t*) (g(y(t*), t*, p*(t*)) - g(y(t*), 

Sk 

t*,p(t*))) (k= 1,2,..., n 2 , t* eT*,p* e^*<t*)), 

and 

d <t> Q ( b) 

Dx(p, b;b) = Z(t Q ) -g— (b-b) (b eB), 

where the matrix function Z is the solution of the system 

Z(t) = -Z(t)g v (y(t),t,p(t)) a. e. in T, Z^) = I (the unit matrix). 

Proof of Lemma 5. 1. Let t° and p D be fixed. For fixed i and j in {l, 2, . . . , n} , 
let t*=t*\ p*=p lj , and M(a) = Ngr n+ T (t*> a) for a 5 0. 

We observe that, for every sequence oyc^, . . . converging to +0, the 

sequence (mJ = is regular at t*, = 0 for all a, and lM(a) !=« 

for sufficiently small ol. It follows that, for all veD°, 

lim - f g(v, t,p(t))dt = g(v, t*,p(t*)) 
a~ + 0 a J M(a) 

if p = p, p = p*, or p(t) = r e R on T. 
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We next consider y(t ;p r ,b) as a function of ( y ° ,b). The measure of the set 
{teT |p’ (t a ,p D , ^ p T (t a ,p D , a; a ) } converges to 0 uniformly in S2(t a ) when 
n 

□ | = Y i — iJ , 0 

,LW u/< ^ » cu ' a; 1 

i.J=l 

Furthermore, [g(v, . , r) | and lg v (v, • , r) i are bounded by some integrable function 
ip 1 on D° x R. We conclude, using standard arguments, that y(t;p' ,b) is a uniformly 
continuous function of ( W D , b, t) and y(t;p' , b) e D° in r x T, where r is some 
neighborhood of (0 a ,b) in S2(t°) x B. 

Now we fix b and sufficiently small w 1J ((i,j) ¥ (i, j )) as well as i,j,t D , and p n , 
and set p(a) = P f (t a ,p a , ) and y(t;a) = y(t;p(o), b) for = ot > 0 and t eT. Then, 
for sufficiently small oc, p(t;a) = p(t;0) for t eT - M(a), p(t;o:) = p*(t) and p(t;0) = p(t) 
for teM(o'), y(t;o) = y(t;0) for t s? t*, and, for t>t*, 


XMa) = — (y<t; or) - y(t;0))= 


“ f (g<y(0;a), o ,p( 0 ;i 


hence 

(5.1.1) A(t;a) 


I, 


;<*)) - g(y(0;O), e,p (0;O))) d0; 


g (y(0;O),0,p(0;O))A(0;o;)d0 


[t*, t] -M(a) 

<g(y<0;o;), 0 , p*(< 9» - g{y(0;O), 0,p(0)))d0 + 


if. 

I 


M(a) 


(g v (y(0;«),0,P(0;O)) - g (y(0;O),0,p(0;O)))A(0;a)d0, 


[t*,t] -M (a) 
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where y(0;o) is, for each 9 and a, intermediate between y(0;a) and y(0:O). Since 
y(. ;o) converges uniformly to y( *;0) as a — 0, lg v (v, t, r) I ^ ^ Q (t) on 
D° x T x R, p* e R*(t*), and T*C T' (ip Q ), we can assert that 

lirn - f (g(y(0;a),e,p*(0)) - g(y(0;O),d,p(O)))dfl 
a— + 0 01 J M(a) 

= lim - f (g(y(t*;O),0,p*(0)) - g(y(t*;0), 0,p(O)))dO 
o^ + 0 a J M(o) 

= g(y(t*, 0), t*,p*(t*)) - g(y(t*;0), t*. p(t*)), 

and that this limit is uniform in F x T. Furthermore, 

g v (y(O;a),e,p( 0 ;O)) - g v (y(0;O). 0, p( 0 ;O)) I 

converges to 0 with a, for each fixed 0 in T, uniformly in r, because g (• , t, r) is 

O r 

continuous, hence uniformly continuous, in some compact set containing D , tor 
every t and r. Moreover, the uniform convergence, hence also the boundedness, of 
the second term on the right of (5.1. 1) implies that A(.; .) is bounded. Since 
Ig (v, t, r) | t . ip (t) on D° x T x R, it follows then that the last term on the right of 

(5.1.1) converges to 0 with a, uniformly in T x T. 

Let rj (t) = lim A(t, q) for tcT. We can now conclude that 77 exists, is 

O' — *> + 0 

unique, that this limit is uniform in T x T, and that 

(5.1.2) 7](t)= f g (y(0;O), 0 ,p(0;O))tj (0)d0 

J t* 

+ g(y(t*. 0),t*,p*(t*)) - g(y(t* f o),t*,p(t*)) for t > t*. 
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Now we must investigate the dependence of 77 on (^c^b). Let and 

(av b 2 ) be botb in r and be such that w l | = ^ = 0, and let y^O.Pjl’M 1 (*) and 
y (.), p 2 (*)> 7 ? 2^*) represent the corresponding determinations of y( . ; 0), p( . ;0), and 
!)(•)• Let also M ={tfT ! Pl (t)/p 2 (t)} and At) = ln.(t) - j, 2 (t) I. Then (5. 1. 2) yields 

A(t) s f if A9)( In (9)1 + Ip (fl)l)d9+ f 4<0)A(0)d0 
J M J t* 

+ j* lg v (y 1 (0).e.P 1 (0))-g v <y 2 <0).fl.P 1 (0))Hri 2 (e)ld0 

+ - g(y 2 <t*),t*, P *(t*))l 

+ Igfr^t*). t*,p(t*)) - g(y 2 (t»), t*. p(t*» l for t > t*. 

We can directly verify from (5.1.2) that 77 is uniformly bounded on T x T. We can 
show, therefore, as in a previous argument, that the third integral in the last 
relation converges uniformly (on T) to 0 with l,^-^ I + [b^-b 2 ^ ‘ i nte gr a ^ 

converges uniformly to 0 because IM I — 0 uniformly, and the non-integrated terms 
converge to 0 uniformly with ly^(t*)-y 2 (t*) I . It follows that £( • ) 0 uniformly on T 

as + lb 1 -b 2 l-0. 

We can solve equation (5.1.2), specifically when cu Q - b = b, and 
r = p*(t*), and find that, for k = nj - n + i, 

D^x(p,b;t*,p*) = 77^) = Z(t*)(g(y(t*),t*,r) - g(y(t*), t*, p(t*))). 

Thus D^x is the same for all k and all p*e^£ *(t*) such that p*(t*) = r. 

Similar arguments prove our assertions concerning y(t;p’ ,b + 0(b 2 - b^)) as 
a function of 0, and yield the representation of Dx(p,b;b). 

This completes the proof of the lemma. 


73 



RESTRICTED MINIMA OF FUNCTIONS OF CONTROLS 


5.2 Completion of the proof of Theorem 3.3. We shall now show that 

define local variations for x in ^ x B at (p, b). It is clear that, by construction, 
the collection satisfies condition (2.1. 1). Since the sets N^(t,a) are unions of 
intervals and p 1 e , n), the mapping p' belongs to $ . 

It follows from Lemma 5. 1 that the function ( w d , 0 D ) — £(cj d , 0O) satisfies 
condition (2. 1.2.2). Indeed, we have shown there that the right hand partial 
derivatives of £ with respect to each ^ at w l ^= 0 and with respect to each 0 1 * at 
6^-0 exist, are continuous, and the limits defining them are uniform for and 
0 D sufficiently close to 0° . Finally, statement (2. 1.2. 3) follows directly from 
Lemma 5.1. 

Thus *,Jf) satisfy the conditions of Definition 2. 1 and Dx(p,b;t*, r) and 

Dx(p,b;b) have the representations described in Lemma 5.1. All the statements of 

Theorem 3.3 . except statement (3.3.5), now follow directly from Theorem 2.2 
T 

after we set z(t) = Z (t)X on T. Furthermore, statement (2.2. 2) implies (3.3.5), 
with R*(t) replaced by R*<t). Since, however, g(v, t, • ) is continuous on R for all v 
and t, we conclude that statement (3.3.5) is satisfied. 

Finally, consider the special case when R^(t) = R on T and contains all 
the constant mappings into R . In that case, for each re R and t* e T*, the set 

oo oo 

Jl *(t*) contains the constant mapping from T to r, and R*(t*) = R = R. 

00 

This completes the proof of Theorem 3.3. 

5.3 Proof of Theorem 3. 5. The first part of the theorem, concerning the existence of 

b and a as well as of the approximating sequences, follows directly from f 1, Th. 3.1] . 

Next we observe that, for S # (t) = (scS !s(R # (t))- l} on T, S,S # , f, and ^ satisfy the 

assumptions made in Theorem 3. 3 about R, R^, g and ^ , respectively. Furthermore, 

since f(v, t, s) = g(v, t, r) on E x T for every measure s = s^ concentrated at the single 
n r 
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~ # 

point r, it follows that the set of a in ^ that are admissible (with respect to f and 
S # ) at t* contains a *(t*). Finally, there exists a dense denumerable subset of S 

containing {s [r e R }. We may now apply Theorem 3.3, with S, S^, f, and ^ 
r # °° 

replacing R, R , g, and &L , respectively, and derive directly the second part 
of Theorem 3. 5. 
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THE REDUCTION OF CERTAIN CONTROL PROBLEMS 


TO AN "ORDINARY DIFFERENTIAL" TYPE 
by J. Warga* 

The most commonly encountered problems of the mathematical 
control theory are of the "ordinary differential" type, that 
is, are defined by systems of ordinary differential equations 
involving control functions as well as by certain additional 
relations that must be satisfied by the control functions and 
the state variables. An interest has also been evidenced in 
certain more general problems in which, for example, ordinary 
differential equations are replaced by difference-differential, 
or, more generally , delay-differential equations, or in which 
absolutely continuous solutions of differential equations are 
replaced by piecewise absolutely continuous solutions with at 
most k jump discontinuities. The purpose of the present note 
is to show that many such non-standard problems can be easily 
transformed into an equivalent "ordinary differential" form to 
which all the, by now, classical results of the control theory 
are directly applicable. 

Let U be an arbitrary set, E n the euclidean n-space, 

ACE , B C E x e , T the closed interval [t , t, ] of the real 
^ n n n ' o i 

axis, V a mapping from T into the class of nonempty subsets of U, 

and f: E xtxu -* E . An "ordinary differential" control problem 

n n 

consists in determining a function u: T U and an absolutely 
continuous function x=(x^, x^ , . . , x n ) : T -+■ E r such that x^(t^) 

is minimum and 

* This work was supported by N.A.S.A. grant NGR 22-011-020, 
Supplement 1. 
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(0.1) 

d -|^- = x(t) = f(x(t) 


(0.2) 

(x (t Q ) , X (t 1 ) ) e B , 


(0.3) 

X (t) e A (t £ T) , 

and 

(0.4) 

u ( t ) e V(t) (t e T) . 


We shall now transform into an "ordinary differential" 

form a few of the more frequently encountered non-standard 
certain 

problems, namely (a)A delay-differential control problems, 

(b) staging problems, and (c) problems involving variable times 
as well as constraints relating the values of x on a finite 
subset of T. For the sake of clarity, we consider only relatively 
simple versions of these problems. 

1. Advance -delay- differential problems . Let t , t , x^, and t^ 

be fixed, t < t < t,< x, , and let an absolutely continuous 
o o 1 1 

increasing function p be defined on [x , x^] . Assume that 

p(t)<t f x <p(t ), and t n < p(x,). Let p (t) =t and 
o— o 1— 1 o 

p i+l ^ = (i=0, +1, +2,..., p^(t) £ tx Q , x ^ ] ) . Let 

U, V, A, B, and T be defined as before, and let the functions 

x and u , into respectively E n and U, be given on the intervals 

[x , t ) and (t, , x,]. Let k(t) and A(t) (te[t , t, ] ) be integers 
o o 1 1 o 1 

defined by P k(t)+1 (t) < T 0 £P k ( t )(t) and 

P-t(t) (t) - T 1 < p -£(t)-l (t) - 

(Such integers exist because p{t)-t is negative and bounded 
away from 0. We also observe that k and £ are step-f unctions) . 

For any function y on [x , x^j , let 
y(t) = (y(p k(t) (t)), y(p k{t) _ 1 (t)),... f y(p Q (t) ) ,. .. ,y(p_ t(t) (t)) , 
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f\j 

and let f(y,v,t) be defined for every t e [t ,^1, every 
y e *•••* E n (k{t) + Mt) + 1 times), and every 
v e U x...x u (k ( t) + £(t) + 1 times). 

We consider the problem of minimizing x 1 (t 1 > subject to 
the functional equation 

(1.1) x(t) = f(x(t), u(t), t) a.e. in , 

and subject to relations (0.2), (0.3), and (0.4). We also 
restrict x to be absolutely continuous on [t Q ,t^]. We 
shall show that this problem can be reduced to the "ordinary 
differential" type. 

Let k = k(t 1 ), I = £(t 1 ) + 1, Xi (t) = x(p i (t)) and 
u i (t) = u(p i (t)) (t e (p(t 1 ),t 1 ], p i (t) e [t q f t ) , 

y = (Y Y x ) and v = (v £, . . . ,vj) (y i e E n , v.^ e U) , and 

let 0 i (y/V,t) = p A (t) f (y / v, t) (t e (p(t 1 ),t 1 l, i = 0,1,..., k) . 

Let also k' be a positive integer defined by 

P k ' (t^) ^ > p^, +i(t]_), anc * let T e (p(t^) ,t^] be defined by 

P k / (x) = t Q . Let y (t) = x^/ (t) on [ x , ] and y(t) = x k , (t) 

on [p (t 1 ) ,7) . 

We now observe that our problem is equivalent to the 
"ordinary differential" problem of minimizing x o ^(t^) subject 
to the differential equations 

x^ (t) = <Jk (x (t) ,u (t) , t) a.e. in tp(t 1 ),t 1 ] (i=0 , 1 , . . . ,k y -1) 

(1.2) x, , (t) = <$>,/ (x (t) ,u(t) ,t) a.e. in [x,!,] 

y (t) = X K 1 

0 a .e . in [p (t^) , t) , 

and subject to the restrictions 
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(1. 

,3) 

x i + i (t i 

) = x i (p (t 1 ) ) 

(i = 0,1 k'-l) , 

(1. 

.4) 

(y (p(t 1 

)), x q (t 1 ) )t B t 

y (t l> = 

(1. 

. 5) 

x ± {t) e 

A (t e [p(t 1 ) 

.tjK p i (t) e lt 0 # t x l ) 

(1, 

.6) 

u^t) e 

V (p^ (t) ) (t e 

[p (t 1 ) ,t 1 3 , 



P i (t) e 




2. Staging problems . We consider the "ordinary differential" 
control problem but, instead of restricting the curve x to 
be absolutely continuous on [t Q ,t 1 ] , we require that x be 
piecewise absolutely continuous, with at most k jump discontinui- 
ties on the interval [t Q ,t ;L 3. We also require that 
x (t . +) (= lim x { t ) ) and x(t.-)(= lim x(t)) satisfy relations 

1 4--.^ 1 f<TT 


of the form (x(x^+), x{i^-)) e ^ E n x E n at 

point of discontinuity in [t Q ,t^l * The values of 

t . (i = l,...,k) can either be preassigned, or can be freely chosen. 
In the latter case, however, we assume that V(t) = U for all t. 

A typical example of such a problem is the guidance of a rocket 

that can jettison certain "stages" when they are no longer 

needed . 

We let 

T o = V T k+1 = t l' 

Xi (6) - x(t 1 + 0 ( T i + i” T o )) ( 0 <e<l, i= 0 ,l, . . . ,k) , 

x. (0) = lim x. (0) , x. (1) = lim x. (6) , 

1 e+o 1 e+i 1 

u^{0) = u(t^ + 0(t^ + ^ “ T 0 ) ) (0 < 0 < 1, i= 0 , 1, . . . ,k) . 

If the (i=0 , 1, k+1) are preassigned, our problem 

becomes one of minimizing x^(l) by a choice of an absolutely 
continuous function x = (x q , on [0,1] satisfying 

the relations 
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( 2 . 1 ) 


dx. (0) 


de 


= (T i+ i - T i )f(x i ( ®>' 


T i + e(T i + i - v* u i (e)) 


a . e . in [0,1] (i^O , 1, . . . ,k) 


(2.2) 

o 

o 

X 

x k (l) ) e B 


(2.3) 

(x i (l). 

x i + l(°)) e (i~0 ,1, ... ,k-l) 


(2.4) 

X^e) e 

A (0 <_ 0 < 1) 


(2.5) 

o 

d> 

•H 

0 

V(T i + 0(T i+1 - T o )) (O<0<1, 1=0,1,., 

. . ,k) 


If the can be freely chosen in some preassigned interval 

[a,b] , we may treat them as constant functions on [0,1], and 
adjoin the following relations: 


dx . 

jgi = 0 a.e. in [0,1] (l-0,...,k> 

a i T 0 (°> !•••! T k+1 (0) - b - 

3. Variable times, and constraints on a finite subset of [t ,t^] . 

We now modify the standard "ordinary differential" problem as 
follows : 

Let T i (i=l,..,k) be either preassigned or free to choose subject 
to the relations t < t <...< t. < t, . In addition to relations 

O—O— — K — 1 

(0.1), (0.2), (0.3), and (0.4), we require that the absolutely 

continuous function x satisfy the relation 

(x(t Q ), x(t 1 ),..., x(x k ), x(t x )) e B # , 

ft 

where B is a given set. 

We may proceed as we did for the "staging" problem, except 
that the relations (2.2) and (2.3) are replaced by 


I 
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and 


(3.1) (x Q (0), x 1 (0),..., x k (0), x k (l))e B # 

(3.2) x.^1) = x i+1 (0) (i=0,l ,...,k-l) 


Northeastern University 
Boston, Massachusetts 
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By P. M. Lion 

Assistant Professor of Aerospace and Mechanical Sciences 
Princeton University 
Princeton, New Jersey 

SUMMARY 

# $ # # $ 5fc 5jC 

The necessary conditions for an impulsive trajectory to be optimum 
can be stated in terms of Lawden' s primer vector. Recently, the primer 
vector has also been shown to have significance for non -optimal trajec- 
tories, indicating how these trajectories can be improved. This paper 
presents a simplified derivation of both results from a single viewpoint. 
In addition, a computational scheme for determining optimum n-impulse 
trajectories is suggested. 


Introduction 

The term "primer vector" was introduced by Lawden (Ref. 1) to de- 
note the three adjoint variables associated with the velocity vector on an 
optimal trajectory. Lawden derived a necessary condition for the opti- 
mality of impulsive trajectories in terms of the magnitude of this vector. 
(Optimum in this memo is defined as minimum characteristic velocity. ) 

Recently (Ref. 2), the definition of the primer vector has been ex- 
tended to non-optimal impulsive trajectories. It can then be shown that 
the primer gives a clear indication of how the original, or reference, 
trajectory can be improved; i.e. , how the reference trajectory can be 
altered so as to decrease the total characteristic velocity while still 
satisfying the boundary conditions. The two main results of (Ref. 2) 
are 

(1) the criterion for an additional impulse* Using this test indicates 
whether or not the reference trajectory can be improved by an ad- 
ditional midcourse impulse. 

(2) the transver sality condition. Using this test one can determine how 
the interior (midcourse) impulses of the reference trajectory should 
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be moved in both position and time so as to decrease the character- 
istic velocity. In addition, it is possible to determine whether 
initial and/ or final coasts will improve the trajectory. 

In this paper the results of (Ref. 1) and (Ref. 2) are rederived from 
a single viewpoint. In addition, a computational scheme for determining 
optimum n-impulse trajectories is suggested. 

Formulation and Notation 

The equations of motion are 

X = V4> (x, t) (1) 

where $ is the gravitational potential. For impulsive trajectories, it 
is assumed that the velocity vector v = (vi , vs, v 3 ) can be altered dis- 
continuous ly ; however, the position vector x - (xi , xa , X 3 ) must be 
continuous. The criterion of optimality is the sum of the magnitudes of 
the velocity increments, 


J = 2 |Av | 

k K 

The optimization problem may then be stated as follows: given an 

initial state (v , x ) and a final state <v f> x f ) , find the trajectory which 
connects these states in a given travel time, t^ , such that J is mini- 
mized. 


Assume that some trajectory T has been found which satisfies the 
boundary conditions and consider small perturbations about V . Let 
(x, v) and (x' t v 1 ) denote the state vectors on T and on the perturbed 
trajectory P ' respectively. Define 


6x(t) = x'(t) - x(t) 
6v(t) = v'(t) - v(t) 


( 2 ) 


If T and T' are sufficiently close to justify a linear analysis, then 
(6v, 6x) are, to first order, the solutions of the following variational 
equations of ( 1 ): 


lik\ 


Uv/ 



( 3 ) 
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where I is the (3x3) identity matrix and G is the ’’gravity gradient" 
matrix. The elements of G are given by 

~ a** 

®ij ~ dx. Sx. 

i J 

In second order form (3) can be written 


6x = G 6x 


(4) 


The (6 x 6) transition matrix fi(t, T) for this system can be partitioned 
into four (3x3) matrices as follows: 


ft(t, T) = 


I sill (t. T) 


Ql2 (t. 


H 


(5) 


iSZaitt, T) Ora(t, T)j 
The adjoint system to (3) is 


(M 


1 o 

■ c \ 

u j 


u 

ol 1 


where \jl and A, are 3 - vectors. In second order form this becomes 


A = gA 


( 6 ) 


identical to (4), Hence the transition matrix for (A, A) will be identical 
to (5): 



= fi(t. T) 


*<n j 

MT)j 


(7) 


It can be shown by differentiation that the identity 


A • 6v - A • 6x = constant 


( 8 ) 


holds everywhere on T, This equation is the basis for most of the 
analysis which follows. 
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Consider I\ a two impulse trajectory (or two -impulse segment of 
a multi -impulse trajectory), with impulses Av 0 at t Q and Avj at tf . 
The primer vector A is defined as the solution to system (6) which 
satisfies the following boundary conditions: 


x ( t o ) 


= X 

o 


Av 

o 



A(t f ) = A f 


I Av f | 


That is, at the endpoints of T, A has unit magnitude and is aligned with 
the velocity increment. A solution satisfying these boundary conditions 
can be found if ftia (t f , t Q ) is non -singular ; the initial value of A is 
given by 


A. = fA £ 


«V ‘o' <v n " ( v V X c 


The above definition of the primer vector is extended easily to 
multi-impulse trajectories. In such cases, the right hand side of (6) is 
different on the different segments. At each impulse point, t^ , A. is 
again defined as a unit vector in the direction of the impulse 


A V = 


Av 


k 


i Av k! 


The solutions from different arcs are, therefore, patched together so 
that A is continuous over the entire trajectory. The primer rate A 
will, in general, be discontinuous at impulse points. (On optimal tra- 
jectories, however, it will be shown that A is continuous and is ortho- 
gonal to A at interior impulses. ) 

Criterion for Additional Impulse 

Consider the two impulse trajectory T (shown schematically in 
Figure 2) which goes from x Q to x f in the prescribed transit time. 

T may be a complete trajectory or a two impulse segment of a multi- 
impulse trajectory. By Lambert’s theorem (Ref. 3) there are no other 
two impulse trajectories (in the neighborhood of T ) which satisfy these 
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boundary conditions. There is, however, a four parameter family of 
three impulse trajectories which do satisfy these conditions. 

Assume that F passes through the point x m at t = t m . The four 
parameters used to describe a neighboring three impulse trajectory 17 1 
will be the time of the midcourse impulse, t m , and the position rela- 
tive to F at this time 6x m - x'(t m ) - x(t m ) . 

r 1 is constructed as follows: The impulse (Av 0 + 6v 0 ) is applied 

at t G so that 37* will pass through x m + 6x m at t m . T* must be 
continuous in position, but not in velocity. A small midcourse impulse 
" ^ v m") is required to null the position displacement at tf(6xf=0), 
Finally, the impulse (Avf + 6vf) is applied at t^ so that the final 
velocity is matched. 

The costs on T and T ' , dropping the higher order terms, 
are as follows: 

on r ; J = | Av | + ! Av_ i 
o f 

on T' : j' = | Av + 6v 1 + | 6v + - 6v j + I Av + 6v I 
o o' m m i f f I 

The difference in cost, to first order, is 



From the definition of the primer vector, 

6 J = A * 6v + i <5v + - 6v ~ ! - A • 6v 
o o m m 1 f f 

Using (8) this becomes 

6j = - A • (6v + - 6v ") + | 6v + - 6v I 
m m m 1 m m 1 

This expression is homogeneous in (6v m + - 6v ”) . 

Denoting the magnitude of the midcourse impulse by c , 
6j = c (1 - A m • 7]) 

where n is a unit vector in the direction of (6v m + - • 


yl 


A PRIMER ON THE PRIMER 


If 6 J can be made negative, then T represents an improvement 
in cost over T . This can occur if, and only if, p(t m ) > 1 . That is, if 
p(t m ) >1 it is possible, by varying the direction of 6x m , to find a T 
such that the required midcourse impulse (6v m + - 6v m ) will point in 
the direction of . Clearly, for this choice 6J is negative. 


Therefore we have the following results: 

(a) If r is an optimum trajectory (or a segment of a multi -impulse 
optimum trajectory), it is necessary that p(t) ^ 1 for all t in the 
interval (t Qf tf) . 

(b) If p(t) > 1 for any t in the interval (t Q , tf) , then there exists a 
neighboring trajectory with an additional impulse which lowers the 
cost. To first order, the greatest improvement in cost can be 
realized by applying the midcourse impulse at the time the primer 
magnitude reaches its maximum, and in the direction of the primer 
vector. 


If the time history of the primer vector is as shown in Figure 2, then 
the necessary condition for optimality is satisfied. Figure 3 shows an 
example of a case where an additional impulse improves the trajectory. 

These conclusions are the result of a linear analysis, and, there- 
fore, effects are additive. If, then, the primer magnitude exceeds unity 
at more than one point, the reference trajectory can be improved (at 
least to first order) by adding impulses at both points. 

Finally, if the reference trajectory is two impulse, then there is no 
other (neighboring) two impulse trajectory which satisfies the boundary 
conditions. If p(t) < 1 for all t Q <t<tf, then all neighboring trajec - 
tories with more impulses actually increase the cost. In this case, 
then, we have sufficient conditions for the reference trajectory to be 
optimal. Figure 2 represents such a case. 

Trans versality 

In this section, an expression is developed which gives the differ- 
ential cost between two neighboring trajectories. This expression is the 
analog of the transver sality condition of the calculus of variations. In 
this case, however, there is wider applicability since neither trajectory 
need be an optimum. This is in distinction to the finite thrust case, and 
is a result of the fact that '’cost' 1 is incurred only at discrete points. 
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Consider the two neighboring trajectories shown in Figure 4. Both 
trajectories are initially on orbit C Q at t 0 . On T , impulses are ap- 
plied at t G and t^. T 1 , on the other hand, remains on C Q until 
ti (- t Q + dt Q ) and then impulses are applied at ti and tf . Both tra- 
jectories have terminal state corresponding to orbit Cf at tf . 

The symbol d{ • ) will be used to indicate noncontemporaneous 
variations, that is 

dx = x* (t + dt) - x(t) 

To first order the relationship between dx and 6x is 
dx = 6x + x dt . 

On r , the cost is 



and on F' 

J* = |vi + - Vi" | + |vt (v f - + 6v f ) | 


The differential cost is given by 


6 J = A • {dv + - dv ) - A * 6v (9) 

o o o f f 

where dv + = vi + (t + dt ) - v + (t ) 
o o o o o 

= 6v + + v + dt 
o o o 

and dv ~ = v dt 
o o o 

Since v is continuous (it depends only on position and time) 


dv - dv ~ = 6v 

o o o 


Substituting in (9) and using (8) 

6 J - A • 6x 
o o 

or, since 6x = dx - x * dt 
o o o o 

6j = A • dx - (A • x + ) dt (10) 

o o o o o 
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This "transver sality condition" represents the difference in cost between 
two neighboring trajectories whose initial points differ in position by dx^ 
and in time by dt Q {and whose final position and time are identical). 

Note that no considerations of optimality have been required. 

If the final point differs by (dt f , dx f ) , the proper expression is 

6 J = - A • dx f + (A f • x f + ) dt f (11) 

Equation (10) can be put in more familiar form by adding X Q • (v Q + -v Q ) = 0 

and noting that dv = v dt . Equation (10) then becomes 
6 o o o 

6j = - X • dv + X • dx + H dt (1^) 

o o o o o 

which is exactly the same form as the usual transver sality equation for 
finite thrust (optimal) trajectories. Similarly (11) becomes 

6J = A * dv f - A f • dx f - H dt f (13) 

Equations (10) and (11) are the fundamental form, however, since the 
differentials are independent. 

Final and Initial Coasts 

To test the desirability of an initial coast, the reference trajectory 
is compared with a neighboring trajectory which has been allowed to 

coast (dt > 0) in the initial orbit (Figure 4). 
o 

Using the transver sality condition (10), the difference in cost to 
first order is 

6j = X * dx - (X • x + ) dt 
o o o o o 

Substituting dx Q = x Q “ dt Q , (the superscript minus refers to increments 
along the initial orbit), this becomes 

6j = X • (x ” - x + ) dt 
o o o o 

Since dt >0, 6j will be negative if 
o 
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From the definition of the primer A Q i s parallel to Av Q . Therefore, 
the last equation implies 

- (A * A ) < 0 

o o 


or 


> 
dt 

t 


In other words, if the primer magnitude exceeds unity immediately 
after the initial impulse, an initial coast would lower the cost. Figure 5 
is an example of this. 

Similarly, it can be shown that if 


dt 


< 0 


t=t 


f 


then a final coast improves the cost. The trajectories being compared 
in this case are shown in Figure 6. For this case, it must be remem- 
bered that, to meet the time constraint, dt£ < 0 . The primer in Fig- 
ure 7 is an example. 


Figure 8 represents a trajectory which is so far from optimal that 
almost anything (additional impulse, initial coast, final coast) will im- 
prove it. 


Circular Coplanar Orbits 

In the special case of transfers between circular coplanar orbits 
the above conditions have a simple geometric interpretation. 

It is more convenient here to shift to polar coordinates. Equa- 
tion (13) for a final coast becomes 

6J = A d 0 + - H dt + < 0 
of f 

and for an initial coast, Equation (12) becomes 

6J = - A d6 ~ + H dt ~ < 0 
do o 

where A^ = A : xs - A a xi - A x xa + A s X i . 
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Let u o be the angular rate of the initial orbit and the angular 

rate of the final orbit. Then the above conditions can be written 


Final coast (dt^. < 0): H/A^ > 

Initial coast (dt > 0): H/A- < u 
o / o o 


(14) 


Now consider Figure 9. Here contours of constant J are plotted 
against the prescribed central angle (cp) and trip time (T). These con- 
tours are closed about the minimum J, or Hohmann, trajectory. The 
value of J increases going outward from this point. 

Along the contours, since J is constant, 

6 J = dcp - H dt = 0 

(Note: = |j-, H = - -|^ ) 

The slope of the contours is thus given by 

m = ^t = H / A e 


To interpret equations (14) geometrically consider Figure 10. The 
original two-impulse contours are shown as broken lines. Straight lines 
with slope u 0 and (J f have been drawn tangent to these contours. The 
solid lines represent contours of constant J when initial and final coasts 
are considered. In region A, where m < u> 0 , initial coasts represent 
an improvement. In region C, where m > final coasts are an im- 
provement. 

For example, if a transfer corresponding to point Pi is required, 
it is cheaper to coast initially through angle Acp (which takes time AT) 
and then perform the transfer corresponding to Fh . 

Also note that the set of points on the cp - T plane which can be 
reached for the minimum (Hohmann) cost is a wedge (cross-hatched on 
the figure). Any point in this wedge can be reached by a unique com- 
bination of initial and final coasts plus the 1 80° transfer . Other points 
may need either an initial or final coast, but not both. 
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Moving Interior Impulses 

Consider the two three impulse segments shown in Figure 11. The 
differential cost between these two trajectories can be derived from 
Equations (10) and (11) and is given by 


6 J = [A + • dx - {A + . x + )dt ] - [A " • dx - {A ~ . x ")dt ] 

m mmmm m m m m m 

= {A + - A ) • dx + (H + - H')dt (15) 

m m m m ' ' 

where the equation 


(H - H") = - (A 


+ 

m 


-A 


X-) 


(16) 


has been used, dx and dt in Equation (15) can be chosen independ- 
ently. 

The following conclusions can then be drawn 

(a) If the reference trajectory is an optimum, then it is necessary that 

A + = A - and H + = H~ 

Since H is constant on any segment, it is therefore constant along 
the entire trajectory. 

If these two functions are continuous, then 

A * (x ^ - x ) = 0 
m m m 

Since A m is aligned with the velocity impulse this last equation 
becomes 

A -A =0 

m m 
dp 

or — = 0 at impulse points. 

(b) If T is not an optimum, then a neighboring trajectory with lower 
cost can be found by choosing 

a*m= - e 

, (IV) 

dt = - € [H - H"J 
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If G is "sufficiently small, " then the three impulse trajectory 
which passes through x + £ dx m at t m + £ dt m will represent 
an improvement over F. Therefore, Equation (15) tells us how 
interior impulses should be moved in position and time so as to 
reduce characteristic velocity. 

A Gradient Scheme for Optimum Multi -Impulse Trajectories 

In this section, a technique is suggested for determining optimum 
n-impulse trajectories (where n is open) starting with the two impulse, - 
or "Lambert, " solution (or any other nominal trajectory). 

This technique is based upon two results: (1) the criterion for an 

additional impulse, which tells when an additional impulse should be 
added, and (2) the transver sality equation as developed in Equation (15) 
which tells how interior impulses should be moved. 

A necessary part of this technique is a subroutine which solves 
Lambert's problem; i.e., given both position and velocity at two differ- 
ent times, find the trajectory which connects them. Also, it is desirable 
to have a subroutine which computes the transition matrix Q(t, T) 
(Equation (5)). For the inverse square field, the formulation of the 
Lambert problems by Pines (Ref. 4) and the transition matrix routine 
by Goodyear (Ref. 5) represent elegant answers to these needs. The 
formulation of both is done in the "universal variables" and thus is valid 
without modification for all conic sections. 

The iteration procedure is as follows: given the position and velocity 

at two terminals plus transit time, determine from the Lambert sub- 
routine the two impulse trajectory which connects them. Imposing the 
appropriate boundary conditions on the primer, determine the time his- 
tory of p(t). If the primer magnitude appears as in Figure 2, then the 
two impulse trajectory is at least a local optimum. If the primer mag- 
nitude rises above 1. 0, then a third impulse must be added. Let t m 
indicate the time when the primer reaches a maximum. Using the 
boundary conditions 6x(t Q ) = 0, 6x(t f ) = 0 , it can be shown that for any 
trajectory T 1 passing through x(t m ) + 6x m , 

(6v + - 6v ) = A 6x 
mm m 

where A =' Q S s (t , t ) Qi 3 _1 (t , tJ-Qaaft , t ) a _1 (t , t ) 

mi mf mo mo 


100 



A PRIMER ON THE PRIMER 


For maximum improvement, (Sv,^ “ ^ v m ) should be parallel to 
. Therefore, assuming A is nonsingular, choose 

5x = e A" 1 A 
m m 

where e is a small constant to insure that the range of linearity is not 
violated. 

Two Lambert problems are then solved. 

(1) Connecting (x , t ) with (x + 6x , t ) 

o o m m m 

(2) Connecting (x + 6x , t ) with (x„, O 

° m mm f f 

If C is small enough, then this three impulse trajectory will represent 
an improvement. In all probability the three impulse trajectory is not 
an optimum. For instance, the plot of primer magnitude vs time may 
look as shown in Figure 12. In this event, after the A. and H before 
and after the impulses are calculated, the following corrections are 
made as given by Equation (17): 

dx = - e (A + \ ) 

m mm 

dt = - £ (H + - H - ) 
m 


Again two Lambert problems are calculated and the process repeated 
until 


I H + - H' | < r, 


< c 


where r| and £ are preassigned tolerances. 

If at any point in the iteration, the primer magnitude becomes 
greater than 1 . 0 at some point, then an additional impulse is added. 
From that point on, three Lambert problems must be solved at each 
iteration. The interior impulses are then moved in the same manner 
as above. In principle, there is no limit to the number of impulses 
which can be handled by this formulation. 
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This technique, which is actually a gradient computation, exhibits 
the properties of first order techniques in general: guaranteed improve- 
ment on each iteration but convergence slowing as the minimum is ap- 
proached. 

The method is presently being programmed by the ASMAR group at 
Princeton. 
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SUMMARY 


The following principle is exploited to obtain five linearly independent 
solutions of the variational equations for Keplerian motion. The principle: 

If a system of differential equations is invariant under a continuous and 
differentiable group of transformations, it is possible in general, by 
differentiations only, to write down a number of linearly independent 
solutions of the variational equations equal to the number of independent 
parameters of the group. In the Keplerian case there is, however, a removable 
singularity occurring when the motion is circular, 

A sixth solution of the variational equations is given by differentiation 
with respect to the eccentricity e, or rather with respect to cos _ le in 
the elliptic case and with respect to cosh"^e in the hyperbolic case, A 
more complicated function of e can be used in the parabolic case, a parabola 
being thought of as the limit of a family of non parabolic conics. 

Numerous formulas and identities are written out explicitly for 
manipulations in the elliptic case, A variation of the Lagrange method for 
integrating non-homogeneous linear differential equations, especially adapted 
for systems of second order, is developed and applied to the elliptic case of 
Keplerian motion. 
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Finally it is shown how the Keplerian differential equations are invariant 
under a group of transformations which, in general, change the eccentricity 
provided that one allows the independent variable to undergo a differential 
transformation . 


INTRODUCTION 

The purpose of this paper is to collect in as explicit a form as possible 
the formulas necessary for handling the perturbation of Keplerian motion in 
rectangular coordinates. The emphasis is on elliptic motion although brief 
mention is also made of both hyperbolic and parabolic motion. 

The proposed method is one of successive approximations based on the 
repeated integration (by quadratures) of a series of non-homogeneous variation- 
al equations. It is more explicit and automatic than the Brouwer method (Cf. 
Brouwer and Clemence, Methods of Celestial Mechanics, Academic Press 1961, 
pages 398 to 414) , but in other respects is much the same thing. The Brouwer 
method itself contains little originality, since, for example, his method of 
integrating the non-homogeneous variational equations is due in principle to 
Lagrange. The merits of the present paper are therefore more in the realm of 
explicitness and exposition than of originality, although it contains a number 
of formulas and theorems that we have not found elsewhere. 

The method of Lagrange is based on a prior knowledge of a complete set of 
solutions of the homogeneous variational equations, The method of finding 
these is based on a group theoretical principle which has contributed to the 
title of the paper and which is explained as follows: 

Consider a system of differential equations of the form 
(1) x* - f(t, x) , 

where x and f are n-vectors, t is the independent variable, f is of 
class C* , and x* represents either the first or the second derivative of x 
with respect to t (or, for that matter, the result of any linear homogeneous 
differential operator with constant coefficients acting on x) . The homogeneous 
variational system based on a given solution x(t) is, by definition, the 
system. 


L* = A(tK , 

where A(t) is an n * n-natrix, namely the jacobian matrix of the components 
of f with respect to the components of x, with x replaced by x(t). Now, 
if x(t) can be imbedded in a one parameter family of solutions, say x(t, p) , 
in such wise that x(t, p^) ■ x(t) , it is both well known and obvious that a 

solution of the variational equations is given by £ ■ (9x/3p) with p set 
equal to Group theory is of value in finding a way to imbed the given 
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solution in a family of solutions depending on one or more parameters. 

Suppose that (1) is invariant under a group of transformations whose 
equations are of the form 

x» - h(t f x, p) , t' - P(t , x, p) . 

Assume that p Q is the parameter value corresponding to the identity, so that 
*» p Q ) = x and P(t, x, p Q ) = t. Then any fixed solution of (1), gives 
rise to a solution x(t, p) such that 

x(P(t, x(t), p), p) = h(t , x(t), p) 

a formula which will lead to an explicit expression for x(t', p) provided 
that the equation t' * P(t, x(t), p) can be solved for t in terms of t' 
and p. In any case, it is clear from the above identity (by setting p = p Q ) 

that x(t, Pq) * x(t) » Several very simple examples of the utility of these 

considerations are given in the next Section in connection with the variationa 
equations of Keplerian motion. 


I. SOLUTIONS OF THE VARIATIONAL EQUATIONS 


The equations for Keplerian motion, with proper choice of the units, may 
be written in the form 


d 2 x 

dt 2 

ix. 

dt 2 


T"2 2 2,3/2 
[x +y *z ] 


T2 2 2,3/2 
[x +y * z ] 


d 2 z 


dt 


r 2 2 2,3/2 

[x *y*z] 


The equations of variation based on a given solution of these Keplerian 
equations, say, x » <£(t) , y * 0(t) , z * w(t) , take the form 


2 2 2 2 

d £ (24> -ii> -w )£ + 3<j>4jn ♦ 34>u>c 

ut 2 [*V ^ 2 ] 5/2 
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2 2 2 2 
U n a + (2^ )n + 

,* 2 * 772“ " 2 27572 

at [4> + i|' + w] 


a C 3o>4? ^ + Siui^n + (2u*~“ 4)*~- d'"' ) £ 

at 2 “ [*vJ ] 5/2 


We propose, using the group-theoretical principle explained above to find a 
complete set of six solutions of the variational equations based on any 
solution of a one parameter family of solutions of the keplerian equations, say 

x ■ <t>(X, t), y « t), z = oj(X, t) , 

where the parameter X is assumed to be independent of the group parameters. 

The six solutions are exhibited as the six columns in the following 3 x tv- 
mat rix 

$> - (3/2) t J l 0 w -ijj <p x ~ 

W* * y - (3/2) t J J -u 0 $ 0 

w - (3/2) t u w <J/ -<J> 0 

where the dots denote differentiation with respect to t. 

The solution in the first column comes from what we shall call the scale 
group. Namely the Keplerian equations are invariant under the group of trans- 
formations x’ ■ p *x, y* *> p *y, z * * p ^z, t 1 « p ^ "t in which the 
identity occurs when p ■ 1, Thus a solution of the Keplerian equations is 

x * p ♦(X, p 3 ^t) , y ■ p iJj(A, p 5/ ^ 2 t), z * p uj( A , p" 3/,2 t). Differentiating 

with respect to p and then setting p ■ 1, we get the elements in the first 

column of the matrix W*. 

The second column comes from the autonomous group, the equations for which 
are x* « x, y* * y, z* * z, f » t+p. Thus a solution of the Keplerian 
equations is x * 4>(A, t+p), y * ^(A, t+p), z * u>(A, t+p). Differentiating 
with respect to p and then setting p * 0 (which corresponds to the 
identity of the present group) , we get the elements in the second column of W* 

The third, fourth, and fifth columns come from the rotation groups. For 
instance the Keplerian equations are invariant under the group x’ - x, 
y' * y cos p - z sin p, z‘ » y sin p + z cos p, t* » t, that is the group 

of rotations about the x-axis. Thus a solution of the Keplerian equations is 
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x » $ y = V cos p - w sin p f z * sin p + oj cos p. Differentiating with 

respect to p and setting p - 0 # we get the third column of W*. The 
fourth and fifth columns are obtained in a similar way by consideration of 
rotation groups about the y- and z-axes. 

Finally the sixth column is certainly a solution of the variational 
equations because it consists merely of the derivatives of $, i|/, and w 

with respect to the parameter X. 

It is known that Keplerian motion always takes place in a plane. Hence 
there is no essential loss of generality in choosing the coordinate system in 
such a way that the given Keplerian motion takes place in the xy-plane. This 
amounts to taking u> = 0 ; thus greatly simplifying the matrix W* by 
annihilating five of its elements (in addition to the three elements which 
are already zero). Furthermore, it is obvious, both from the columns of W* 
and from the variational equations themselves, that with w = 0, the 
variational system splits into two systems one involving only £ and n and 
the other involving only 5. The second of these systems is relatively trivial, 
and hence from this point on we confine most of our attention to the first of 
these systems, namely the system 


( 2 ) 


d 2 £ £ ♦ 34>vpn 

dt 2 ‘ ( *V) 5/2 

d 2 q * (24 j 2 -4> 2 )q 

dt 2 * C *V ) 5/2 


where <f> and ^ satisfy the two dimensional Keplerian equations, 
(3) + * T- 5 T7T » 5 


- £ 

772 2.3/2 


7v) 3/2 


The most general elliptic case can be obtained (by the operation of a 
scale transformation, a rotation, and a translation of t) from the following 
one-parameter family of solutions: 


$ ■ - cos X * cos B 


(4) 


4* « sin X sin 6 

where 0 is to be regarded as a function of t through 
(5) t ■ 6 - cos X sin 6 . 

An elementary calculation shows that (3) is satisfied by 


the Kepler equation 


C 4) and (5). The 
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quantity 0 is known as the eccentric anomaly, The eccentricity of the orbit 
is cos A, at least, if 0 < A n/2. 

Similarly the hyperbolic case can be handled by taking 

$ * - cosh A ♦ cosh 0 

gj ■ sinh A sinh 0 

where 0 is defined as a function of t by the equation 

t * cosh A sinh 0-6. 

The most general parabolic case can be treated by taking 

* - j (i - e 2 ) 

ij; ■ 0 

where 0 is defined as a function of t by means of the equation 


The parabolic case differs from the other two, hecause al 1 parabolas are 
obtainable from this one standard parabola by the operation of the three 
groups, There is no parameter A, independent of the group parameters, 
enabling us to write the last column of the matrix W*. This situation can be 
rectified by finding a family of conics consisting of ellipses and/$r hyper- 
bolas which, as the parameter A of the family tends to a certain value, say, 
0, converges to our standard parabola. Such a family of ellipses converging 
to our parabola as A ■+ 0 is the following: 

2 

4> ■ - cot A esc A + esc A cos ( 0 sin A) 

^ * esc A sin(0 sin A) 

2 2 

t ■ 0 esc A - cot A esc A sin(0 sin A) . 

It is not within the scope of this paper to discuss any further either the 
parabolic or hyperbolic orbits. 


II, MISCELLANEOUS FORMULAS FOR ELLIPTIC MOTION 

We restrict most of our attention to the elliptic case, Since we are 
also restricting attention to the planar case, the equations of variation will 
have only four linearly independent solutions. They are exhibited as the four 
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columns in the following 2 x 4-matrix, 

<J>-(3/2)t<j> 

Y - (3/2)t J J <p 


( 6 ) 


W 


This matrix is, of course, derivable from the matrix W* by omitting the last 
row and the third and fourth columns. In writing down the explicit expressions 
for these solutions, it must be remembered that 6 is regarded as a function 
of both t and X, We readily find from (5) that 


(7) 


and 


(8) 


Thus , 

we find from 

(9) 

♦ (t) 

(10) 

i(t) 

(11) 

♦x<*> 

(12) 

* x ct) 


at 


cos X cos 6 


90 - sin X sin 0 


9X 1 - cos X coc 


sin 0 (9 6/9t) 


- sin 


ij>(t) ■ + sin X cos 9 ( 9 9/9 1 ) 


1 - cos X cos 0 
sin X cos 


1 - cos X cos 


sin X sin 


1 - cos X cos 0 


^ (t) * cos X sin 0 + sin X cos 6 (90/3X) 


cos X sin 0 - 


2 

sin X cos 6 sin 
1 - cos X cos 0 


These formulas are sufficient to enable us to write down at once all the 
elements of W. It will also be necessary for us to have W, the derivative 
of W with respect to t. For this purpose we record the following formulas 
obtained by differentiating formulas (9)- (10) with respect to t. 


(13) 


i(t) 


cos X - cos 0 

(1 - cos X cos 0) 3 
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( 14 ) 


*(t) 


- sin A sin 6 
(1 - cos A cos 6)" 


Before differentiating $ x and ^ it is convenient first to observe 
that they may also be expressed in the forms 


(15) 

(16) 
Hence 

(17) 


<P X (t ) - sin X - i(t) iHt) 

^(t) ■ cos X sin 0 - ^(t) i(t) 

J x (t) - - i(t) vKt) - J(t) i(t) . 


To find i x (t) first note from (4) that 

(18) <t> 2 + ^ a ( 1 - cos X cos 0)“ 

Differentiating this, we get 


4>j> + i>i « (1 - cos X cos 0 ) ( cos X sin 8)(99/9t) - cos X sin 6 


So (16) may be written 

(19) 0 x (t) ■ * W - <M» * 

Hence 

(20) J' x (t) “ J 2 ♦ 

The <£ and may always be eliminated with the help of (3). 

Some other formulas worth noting and easily derivable from the preceding 
are the following 

(21) <}>(t) 4 > x (t) ♦ iHt)* x (t) - 4 >(t) sin A 

(22) $(t)i(t) - J(t)*(t) a sin A 

(23) , 2 , - (i 2 ♦ J 2 ) * 1 . 

c * 2 * ») iu 
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If we write the variational system (2) as a system of four linear 
differential equations of the first order 


whe re A 


dC . 

Ut ' 

dn 

dt 

dt 

dn , 

dt 


2<T 


0 5 ♦ 0 n ♦ U ♦ 0 n 

o e ♦ o n + o | ♦ lj 

A £ ♦ B n ♦ 0 i ♦ 0 J 

C £ ♦ D n ♦ 0 £ + 0 n 

2 


* 


(4 


, 2 ^ 5/2 


B * + C 


3# 


7T2 ^ .2.5/2 
U * ^ ) 


and 


p . » 2 ? .. 

cV ♦ 


it is seen at once that the trace of the matrix of coefficients on the right 
i*- zero. Hence, by a known theorem on linear differential equations, we know 


that the determinant of the matrix 


ix /W\ is a 

(ft) 


constant. It can therefore be 


evaluated in a simple manner by evaluating it at t * 0, From the above 
formulas it is easy to write down the following matrix. 


(24) 


W(0) 

W(0) 


1 - cos X 


l,sin X ^ 
2 l l-cosX J 


Evidently 


4 : <o> ) 

V'(0)/ 


0 

sin X 


1 - cosX 

1 

2 

(1-cosX) 

0 


0 

1 - cos * 

- 5 * n * 

1 - cos X 


1 - cos X 


1, sin X . 

■ r h-c -s si " 0 


sin X 


- 1 

1-cosX 


sin X 
1-cosX 

- 1 


sin X 
0 

0 

- 1 


1 - cos X 


[-( 1/2) (1-cosX) ] [-cosX(l-cosX)‘*] * ♦ 2“ A cos X 


(1-cosX) 

o-l 


1 - cos X 


- sin X 
2 ( 1-cosX) 
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This shows that our four solutions are not linearly independent if X ■ it/ 2. 
In fact this becomes obvious anyway; for, with X * tt/ 2, 6 = t, 4>(t) ■ cos t, 
4»(t) * sin t, $(t) - - <Ht), ^(t) ■ , so that the second and third 

columns of W are identical. The case X ■ tt / 2 is the case of zero 
eccentricity, corresponding to circular motion, The singularity occurring at 
X ■ tr/2 is, however, easily removable, in accordance with the following 
theorem. 


Theorem 1 , The variational equations admit the solution 

„ .. i * ip - <t> 

r m " ■ ■ , n * T 

cos X * cos X 


and this solution has a removable singularity at X * rr/2. 


Proof, The first statement is an obvious consequence of the fact that the 
(£ ,n ) of the theorem is a linear combination of the second and third columns 
of the matrix W of (6), 


As for the second statement, we find by an elementary calculation based 
on (4), (9), and (10) that 


<t> ♦ 


cos X cos X L 1 


♦ sinX sinO] 


-sine ♦ sinX sin9 - sinX cosX sine cos9 
(1 - cosX cos eY cos X 


, - sin 6 > . 1 - sinX ' 

M - cosX cose^ ^ cos X ) 


sinX sine cos8 
(X - cosX cose) 


n 


£ Z 1 m 1 

cos X cos X 


[■ 


sinX cose 
1 - cosX cose 


(- cosX + cos6) ] 


sinX cose - cose + cosX + cosX cos e - cos X cose 
(1 - cosX cos0)cosX 


- C 


cos e 


' 1 - cos X cos6 J ' cos X 


;)( 


1 - sinX 


0 ♦ ( 


1 + cos Z 6 - cosX cosi 
l - cosX cose 


'■) . 
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.. 1-sinX cos X „ , 

N0W cos X * 1+s T n T ^ 0 as x 7T ' 2, Ve als0 have 6 ^ t * Hence in the 

limit as X -► ir/2, we have 

£ ~ - sin t cos t * - -- sin 2t 
2 3 1 

n e 1 + cos t * + — cos 2t . 

It is readily verified that these equations satisfy the variational 
equations when $ * cos t and « sin t. They afford a solution which may 
be used to replace either the second or third column of W to preserve a 
complete set of linearly independent solutions in the case X * */2. 

The inverse of the matrix in (24) is important for future purposes. It 
was calculated in the usual routine way, and we record the result here. 



2 

(1-cosX) 2 


0 


2sinX 

1-cosX 


0 


tan X 


(1-cosX) 2 
cos X 


0 


- 1 

(1-cosX) cosX 


- tan X 


0 


-sin X 
( 1-cosX) 2 


0 


0 



III, APPLICATION OF NON-HOMOGENEOUS LINEAR EQUATIONS 
TO THE PERTURBATION OF ELLIPTIC MOTION 


Perturbation of Keplerian elliptic motion generally requires the 
solution of a non-linear system, which, with suitable choice of coordinates, 
may be put in the form of the following three equations. 


(26) 


d 2 C 

dt 2 


(2T 


JlIL. 


[* 


TW 




f 
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(27) 


- On 

2 2 2 5/2 ^ 

dr [<T * 1» ] ' 


(28) 


dfl 

dt 


[<T 


,2,3/2 


in which the non-linearities in C# n # and occur only in f, r, and h 
and are small compared to the linear terms. Solutions are required to satisfy 
given initial conditions or more complicated boundary conditions. A useful 
method is that of successive approximations in which an approximate solution 
is inserted for n, C in f, g, h, which accordingly are temporarily 

regarded as known functions of t. We then are apt to get a better approxima- 
tion by integrating the resulting non-homogeneous linear system, and then the 
process is repeated. The process, when infinitely repeated, converges to an 
exact solution under suitable conditions. In this paper, however, we are not 
concerned with convergence questions, but rather with efficient methods for 
solving the non-homogeneous linear system consisting of (26) , (27) , and (28) 
when f, g, and h are regarded as known functions of t. 

Notice that (28) can be solved independently of (26) and (27), Indeed 
it admits the general solution 


t 

(29) r, ■ a<f>(t) ♦ b\Kt) ♦ esc A j [^ ( t ) <j>(s) - $(t) y(s) ]h(s) ds , 

0 

where a and b are constants of integration. The reader nay verify this 
statement a posteriori with the help of (3) and (22). 

We wish to get a similar result for (26) and (27), which are not 
decoupled. In the next section we develop a general theory of non-homogeneous 
linear systems consisting of n second order differential equations. When 
n * 1, this theory leads to the trivial result (29). When n « 2, it yields 
the not so trivial analogous result for the system consisting of (26) and (27). 


IV. A MODIFICATION OF LAC RAN CL *S THEORY 
OF VARIATION OF PARAMETERS 

We wish to develop here a modification of the formula of Lagrange for 
solving a non-homogeneous system of linear differential equations in terms of 
a complete set of solutions of the corresponding homogeneous system. This 
formula is commonly referred to as the "Lagrange variation of parameters" 
formula. We wish to present a version which can be applied directly to a 
system of n equations of the second order in n unknowns without the 
necessity of rewriting the system as one of 2n equations of the first order 
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in 2n unknowns. The system of interest in our immediate applications is of 
the form 


(30) ^-4 * A(t) x ♦ f(t) 
dt" 

where x is an n- vector whose components are the n unknown functions, 

A(t) is an n * n-matrix whose elements are known continuous functions of the 
independent variable t, and f is an n- vector whose components are known 
continuous functions of t. It became evident however that the more general 
equation (35) below could be treated equally well. The n x n-matrix B(t) 
will be assumed to be of class C'. 

Theorem 2 . Suppose that 0(t, s) is an n * n-matrix whose elements are 
functions of class C" of the two variables t and s. Suppose furthermore 
that 

(31) 0(t, S) - A(t)0(t, s) ♦ B(t)0 (t , s) , 

(32) 0 (s , S) - 0 

(33) G(s, s) * I , the n * n-identity matrix, 

and let 

(34) 

Then 

(35) x(t) * A(t)x(t) ♦ B(t)x(t) ♦ f(t) 

(36) x(t 0 ) » 0 

(37) i(t Q ) - 0 , 

where the dot on 0 represents differentiation with respect to its first 
argument and the dot on x(t) denotes differentiation with respect to t. 

Proof , Evidently (36) follows at once from (34), Differentiating (34) 
we have 

t 

x(t) = / fl(t, s) f(s)ds ♦ 0(t, t)f(t). 


Because of (32) this reduces to 


t 

X(t) * / #(t, s) f (s)ds . 
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(38) 


x(t) - / 0(t, s)f(s)ds . 


w 0 


and then (37) is seen to hold. Now differentiating (38), we get 


X(t) - / 0(t, s) f (s)ds * 0(t, t)f(t) . 


0 

Hence from (31) and (33), we get 


*Ct) = / [A(t)0(t, s) ♦ B(t)0(t, s)]f(s)ds ♦ f(t) . 


This can also be written 

t 

x(t) - A(t) / 


0(t, 


s)f(s)ds ♦ B(t) 


t 

/ 0(t, s)f(s)ds ♦ f(t). 



Hence from (34) and (38) we see that (35) must hold. 

The existence and uniqueness of the matrix 0(t, s) , with the properties 
described in the hypothesis of the above theorem, are self evident from the 
existence, uniqueness, and continuity theorems for linear differential 
equations. It may be calculated from any n x 2n-matrix solution X of the 
equation 

(39) X(t) - A(t)X(t) ♦ B(t ) X(t) , 

provided that 



Since the columns of 0(t, s) are solutions of x * A(t)x ♦ B(t)x, 
they must be linear combinations of the 2n columns of X(t) , with 
coefficients which are functions of s. This amounts to saying that there 
exists an n * 2n-matrix Y(s), such that 

(41) 0(t, s) * X(t)Y'(s) (Y* - transpose of Y) . 

Y(s) is now determined by (32) and (33), which may also be written 


(42) X(t)Y»(t) - 0 

(43) X(t)Y'(t) - I , 
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which last two equations, because of (40), are just sufficient for the 
determination of Y, 

Theorem 3 . The matrix Y(t) described above satisfies the system 

(44) Y(t) - (A'(t) - B'(t))Y(t) - B ' (t) Y (t) . 

Proof , Differentiating (42), we obtain 

(45) XY» ♦ X*' - 0 
Hence from (43) , we have 

(46) (i) 5CY* « I and (ii) X*' - - I , 

Differentiating (46i) and using (39), we have 

(47) [AX + BX] Y ' ♦ X?' - 0 , 

Since XY' ■ 0 and XY' * I, we find from (47) that 

(48) - - B . 

Differentiating this last relation and again using (39), we have 

(49) [AX + BX]Y» ♦ XY* ■ - B 

But, since by (46) xt* ■ - I and by (48) xt' ■ - B, we find that (49) 
becomes - A - B 2 * XY' ■ - § so that 

(50) - A ♦ B 2 - § 

Multiplying (46i) on the right by A - B we have 

(51) XY' (A - B) * A - B , 
while from (48) we have 

(52) jCY'B « - B 2 . 

Subtracting (51) from (50) and adding (52) we get 

(53) *[Y' - Y ' (A - B) ♦ $’B] • 0 

Differentiating (46ii), we have + XY* « 0, so that it follows 

from (48) that 

(54) X?' - B . 
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We multiply (42) on the right by (A - B) thus obtaining 

(55) XY 1 (A - B) - 0. 

Using, once again, ( 46 i i ) , multiplying on the right by B, we see that 

(56) XY 1 B » - B . 

Adding (54) and (56) and subtracting (55) , we obtain 

(57) X [ Y * - Y'(A - B) + Y'B] - 0 . 

It follows from (40) , (53) , and (57) that 

Y* - Y» (A - B) ♦ Y’B ■ 0 . 

Hence, taking the transpose, Y - (A* - B')Y * B'Y ■ 0, which is obviously 
equivalent to (44), as we wished to prove. 

The system (35) is said to be self-adjoint if 

(58) A' (t) - B 1 * A(t) and B'(t) » - B(t) , 

In the self adjoint case Y(t) and X(t) satisfy the same differential 
system, namely (39). 


Theorem 4 . Y(t) satisfies the initial conditions 


(59) 


X (0) Y ' (0) - 0 
X (0 ) Y * (0) - I 


X(0)Y*(0) * - I 
X(0)Y'(0) - - B (0) , 


and, since (40) holds, Y(t) is uniquely determined by (59) and (44). 

Proof, We get (59) from (42), (46), and (48), by setting t * 0, Because 
of (40) with t ■ 0, the initial values of Y and Y are uniquely 
determined by (59), and Theorem 4 follows from existence and uniqueness 
theorems for differential equations. 

/o -I \ 

and J = I I where B» * B(0). Then, if 

V V 

the system is self adjoint, Y(t) may be found from the formula 
(60) Y(t) . X(t) Ly" 1 J' (L 0 ‘V , L Q . 1.(0). 


Theorem 5. Let L(t) 


r X(t)^ 

L X(t)j 


and 
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(61) 

Proof. Let H 


0(t, s) = 



so that 


X(t) L 0 _1 XCS)' . 

1C = (Y\ Y*), H, I. and 


.1 


arc all 2n * 2n- 


matrices. Conditions (59) may be written in the abbreviated form 


subscript 0 means that the matrix in question is evaluated for 
t = 0. Since the system is self-adjoint, we see from (44) and (58) that 
Y(t) * A(t)Y(t) + B(t)Y(t). Prom (39) it follows that the columns of Y 
must be linear combinations of the columns of X. In other words there exists 
a 2n x 2n-matrix Q of constants such that 

(63) Y(t) = X(t)Q 

and hence Y(t) = X(t)H, The last two written equations may be written more 
briefly as H(T) *= L(t)Q. In particular, on setting t =* 0, we have 


Therefore Q * L Q 1 . Inserting this into (63), we find that 

Y(t) = XCt)^” 1 Il 0 , while from (62) we have 11* 0 = L 0 _1 J, Hence 

H () * J, ( L o 1 )** inserting this expression for H in the last formula 

for Y (t) , we see that (60) has been proved. And (61) now follows from (60) 
and (41). 

In order to apply Theorem 5 to the system (30), we take B(t) = 0, so 
that the condition (58) for self adjointness reduces to the requirement that 
A(t) be symmetric. It actually is symmetric not only in the case of the 
variational equations of Keplerian motion but also in the case of the 
variational equations connected with any solution of a conservative holonomic 
dynamical svstem. In fact the matrix A in such a case is merely the 
negative of the Hessian matrix of the potential function with the given 
solution inserted, assuming that the coordinates are t chosen in such a manner 
that the kinetic energy is given in the form (1/2) (x • x) . 


(62) 

where the 


V. THE 0- MATRIX POR ELLIPTIC MOTION 

We now proceed to the consideration of the example alluded to at the end 
of Section III, restricting attention to the elliptic case, The matrix X of 
Section IV is now, in this special case, to be represented by the matrix W 
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of Section II. Hence the matrix L(0) 


is the matrix 


U(0) 


and the inverse 


['V(f)) 


L(0)~ is accordingly the matrix of formula (25), Using this and various 
other formulas of Section II, the four elements of the matrix Q(t, s) were 
calculated from the formula (61). Ke give only a brief outline of the tedicus 
but straight forward calculation by mentioning only the following three inter- 
mediate formulas, First, by carrying out the indicated matrix multiplication, 
we have 



0 

♦ 2 

0 

0 

L(O)' 1 J (L(0) * 1 ) ' . 

-2 

0 

0 

tanA 


0 

0 

0 

-secA 


0 

-tanA 

secA 

0 


Secondly by using the identities of Section II, it is not difficult to 
establish the two following identities: 


^ secA + i tanA ■ - M esc A 
i|/ tanA - $ secA * 1 ♦ esc A, 

which are useful to eliminate the singularity at A ■ tt/2. 

Let 0^(t, s) be the element in the i row and j column of 
0(t, s), Then ve present our final results in the form: 

Q n (t,s)«3(s-t)i(t)i(s)+2[<Ht) i(s)-J(t)*(s) ]-[^(t)i(t)$ x (s)-^(s)i(s)4i x (t) ]csc A 

e 21 (t,s) ■ 3(s-t)J(t) J(s) + 2[y/(t)4»(s) - i(t)$(s)] 

♦ f4>(t)i(t)4> A (s) ♦ ^(s)i(s)i^ x (t) ] esc A ♦ 4 > x (s) 

0 12 (t,s) * 3(s-t)i(t)J(s) ♦ 2[^(t)J(s) - J(t)i|;(s)] 

- (^(t) i(t)^(s) + 0(-s)i(s)0 A (t) J esc A - <t> x (t) 
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0 22 (t,s) - 3(s-t)i(t)i(s) + 2[*(t)i(s) - i(t)*(s)] 

♦ [0(t)i(t)^ A (s) - ♦(s)l(s)<» x (t)] CSC X + |> x (s) - 0 x (t)] . 


VI. ECCENTRICITY CHANGING TRANSFORMATIONS 

In Section I we used the fact that the Keplerian differential equations 
were invariant under the scale, autonomous, and rotation transformation groups, 
These make it possible to pass from any trajectory to any other trajectory 
with the same eccentricity. The question naturally arises as to whether the 
equations are also invariant under a transformation group enabling a passage 
from any trajectory to another trajectory with different eccentricity. The 
answer appears to be in the affirmative provided that we allow a differential 
transformation on the time t instead of the simple transformation of the 
form t* *= P(t, x) as indicated in the introduction. This proviso causes 
difficulty in the use of such transformations, but nevertheless we present 
the theory in the hope that an application for it may be found at some future 
time. 


Because of the rotation groups it is obviously permissible to confine 
attention to the planar problem, and indeed to orbits whose major axes lie 
along the x-axis of coordinates. We thus find it sufficient to consider the 
transformation T(a, b) , depending on the parameters a and b, defined 
by the equations, 


(64) 


, / 2.2 / 2 2 

C * x / a + b + a / x + y 

n * b y b f 0 . 


These transformations are easily seen to form a commutative group, since 

- 1 * 2-1 

T(0, 1) is the identity transformation, and since T(a, b) = T(- ab ,b ), 

and T(a, b) T(a, 6) * T(a /a 2 ♦ B 2 + a /a 2 ♦ b 2 , bB) . 

These transformations all leave the origin invariant, They also leave 
the x-axis invariant. But, it is possible to pass from any point (x, y) 
with y + 0 to any other point (C, n) with n i 0. We have only to take 


r / 2 2 /_ 2 2 

(65) a - 5 -£ * l l - * ■'E « n , b - 2. 

y 2 y 

It is also possible to pass from any point (x, 0) with x t 0 to any other 
point (Ci 0), with C i 0 and sgn C ■ sgn x. 
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Since it is proved easily from (64) that 


2 2 2 2 2 2 

(66) / C + n 4- = /(a" + h ) (x" + y“) + ax 

2 2 .1 

it is seen (with a + 0) that the ellipse with eccentricity |a|(a“ + b 41 ") 7, 

focus at the origin, and directrix x = a is carried by the transformation 

into the unit circle £*" ♦ n* ■ 1. It can be seen further, restricting 
attention to conics with focus at the origin and directrices parallel to the 

y-axis, that a conic with occentricity c is carried over by T(a, b) * into 
a conic with eccentricity 

e /a“~ + b*~ ♦ a 
V a * b ♦ a e 


It follows that ellipses are carried into ellipses, parabolas into parabolas, 
and hyperbolas into hyperbolas. Except in the case of the narabola the 
eccentricity is always changed by any transformation of the grou*'* for which 
a t 0, Any ellipse can be carried into any other ellipse, an)* parabola into 
any other parabola, and any hyperbola into any other hyperbola. That the 
transformation carries any conic with focus at the origin (but with directrix 
not necessarily parallel to the y-axis) into another conic with focus at the 
origin is nore difficult to see; but it is one of t lie conclusions that can be 
drawn from the following discussion of the Keplerian differential equations. 

We now examine the effect of the transformation (64), where a and b 
are regarded ns constants while x ■ x(t) and y * y(t) are functions 

satisfying the Keplcrian equations x = - x r"\ y = - y r where 
2 2 2 

r * x ♦ y 4- . Thus £ and n are also to be regarded as functions of t. 

')')■> 

We also introduce p^ « £“ ♦ n*' and another variable a defined uniquely by 
the requi rements that da/dt * p/r and that o ® 0 when t 3 0, Thus o 
is a function of t, but we can also solve for t in terms of a and use 
the latter for the independent variable. In this way £ and n can be 
regarded as functions of o and we write £' « d£/do * (d£/dt) (r/p) , with 
similar formulas for n, Wc wish to find differential equations (similar to 
the Keplerian differential equations) to be satisfied by £(o) and n(a). 

It is known that xy - xy ■ c is independent of t (and hence also of 
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• - 1 • “1 

a). This is also true of cy - x r = A and cx ♦ y r = B. 

A straight forward calculation shows that the quantity y ■ - C'n is 

also independent of t (and hence of a). In fact it shows that y = be. 

2 2 1/2 

Still further, if we introduce the quantities, Q - (a ♦ b ) - aA, 

M . bB> an d n . A(a 2 ♦ b 2 ) 1/2 , all of which are independent of t and a, 
our straight forward calculation shows that yV + Qno 1 = M and 


vq* - oco' 1 = N. Multiplying the first of these two equr'ions by n and the 
second *by £, we find after a subtraction that y(Cn f - Vn) = Qp * - Mn, 


Hence y 2 - Qp - N£ - Mn. It is also easy to prove that 


V 2 U' 2 ♦ n' 2 ) * (M 2 ♦ N 2 ) ♦ 2Qp‘ 1 (N5 

0 7 -12 

-= or ♦ x ) ♦ 2Qp (y 

. m 2 ♦ N 2 - Q 2 ♦ &L. 

0 

Thus we have established the following 


- Mn) ♦ Q 2 

- Qp) ♦ Q 2 


two relations 


Cn’ - C'n = be 


— + n'^) * Qp * * (M^ * N - Q )/(2y ) . 


If we now differentiate these two equations with respect to a and solve for 

-3 

the second derivatives C" and n", we find that - - QCo » 


n »» * . Onp" 3 . Since Q is a constant, these equations are already in 
Keolerian form; so that it is clear that the point U(o), n(o)) describes 
an orbit which must be a conic section, and, if it is a non-degenerate conic, 
one focus must be at the origin. 


Nevertheless Q need not be unity. Hence 

invariance of the original equations x * - x r 
another modification of the independent variable 

wise that df/do = Q^ 4 ", with t * 0 when o * 


in order to achieve the 

^ y s . v r -3 , we make 
. bet t be defined in such 

0. Then using x as the 
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independent variable, we find that d 2 £/dx 2 - - £p~ 3 and d 2 n/di 2 * - n p‘ 3 , 
where the relation between i and t is evidently 

Although A is a constant of the motion, it may change from one motion to 
another. Hence it has to be considered as an abbreviation for 

(xy - xy) y - x(x" ♦ y 2 )“ 1/<2 . 

We conclude therefore that there exists a function U(x, y, x, y, a, b) 
such that the transformation 


r / 2 ,2 , 2 2 

£ “ x /a + b * a/x * y 
n * b y b t 0 

dx/dt ■ U(x # y, x, y, a, b) 


transforms the Keplerian differential system x = - x(x 2 + y") -3 ^ 2 , 
y * - y(x 2 ♦ y 2 ) 3/ ^ 2 into the Keplerian system, 

d 2 £ £ d"n n 

77 I * “T " “ ~ * 

dx p di p 

There is a question about the sign of the quantity Q; for, if it is 

negative, our change in the time variable through dt/do » would be 

imaginary. We have not investigated this situation, except to note that Q 
is necessarily positive when the motion (x(t), y(t)) is either elliptic or 
parabolic. For it is well known that the constants A and B, introduced 
above, are related to the eccentricity e through the equation 


Hence | A [ < e ^ 1 in either the elliptic or parabolic case, and since we 
must always take b ^ 0, we sec immediately that 

<! - (a 2 ♦ b 2 ) 1/2 - aA > (1 . 
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SUMMARY 

A new proof of the Sundman inequality with refinements, and its utility in 
the introduction of a monotonical ly increasing angular variable for the 
Lagrangian inertial radius in the n-body problem. A discussion of the possible 
amount of information derivable from these considerations by studying them in 
the light of the integrable two-body problem, 


INTRODUCTION 

In this paper we give a proof of the well known Sundman inequality which 
is far superior in generality, precision, and elegance to any which we have 
hitherto seen. The additional terms to be introduced in order to turn the 
inequality into an equality are explicitly exhibited in a reasonably simple 
form. Thus, by including some or all of these terms, we achieve the superior 
precision noted above. All these results are based on some simple theorems in 
the field of classical vector analysis. In fact the essence of the Sundman 
inequality appears in a more general setting than that of the n-body problem. 

The last sentence needs to be emphasized despite the fact that the only 
applications of the inequality have been to the theory of the n-body problem, 
and despite also the fact that in this very paper we have ventured (through the 
Sundman inequality) a further modest contribution to this theory. Namely we 
have shown how to introduce a monotonically increasing "angular" variable 9(t) 
with certain interesting properties. Upon setting X - R cos 0 and 
Y ■ R sin 0, where R is the Lagrangian inertial radius, we investigate 
differential equations to be satisfied b; X and Y. These equations are 
shown to contain enough information essentially to solve the problem completely 
when n * 2, Naturally the situation for n > 2 is very different, since the 
more complicated cases need much more information for their complete solution, 
But it may be reasonably hoped that the same information which solves the two- 
body problem completely may at least yield some interesting qualitative results 
for the n-body problem. 
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I. PRELIMINARY THEOREMS IN VECTOR ANALYSIS 


In the sequel the inner or scalar product of two 3-vectors a and b is 
denoted either by (ab), without a dot, or by a * b, without the parentheses. 
The outer or vector product will be denoted by (a * b). 

Theorem 1. Let a, b, c, d be any four vectors in three dimensional space, 
Then the scalar U(a, b, c, d) * (aa) (dd) - 2(ac)(bd) - 2(ab)(cd) ♦ 2Cad)(bc) ♦ 
(bb) (cc) is not negative. 

Proof , The theorem is certainly true if d * 0; for then W * (bb) (cc) > 0. 
lienee we may fix attention on the case d + 0. Then, regarding b, c, and d 
as constant vectors, we seek to minimize W by varying a. Evidently the 
gradient of W with respect to the components of a is the vector 

3W/3a - 2a (dd) - 2c(bd) - 2b(cd) ♦ 2d(bc) , 

while the hessian matrix of W with respect to the components of a is the 
positive definite matrix, 2(dd)I, where I is the 3*3 identity matrix. It 
follows that W assumes its minimum value when a is such that it makes the 
gradiant of IV vanish, namely, when 

a * (dd)" 1 [b (cd) ♦ c(bd) - d(bc)j . 

If we substitute this expression for a in the formula for IV, we find by a 
routine calculation that the minimum value of W turns out to be 


- 1 

(bb) 

(be) 

(bd) 

(dd) 

(cb) 

(cc) 

(cd) 


(db) 

(dc) 

(dd) 


We have here the determinant of a certain type of symmetric matrix which is well 
known to be positive if the vectors b, c, and d are linearly independent 
and, otherwise, it is positive semi-definite. Cf. Hardy, Littlewood, and 
Polya, Inequalities, Cambridge University Press (1934), p, 16. In either case, 
the minimum value of the determinant (and hcncc of IV) must be non-negative 
and thus the theorem is proved, 

Theorem 2 . Let u i» U 2 » *•*» u n be n vectors in ordinary three dimensional 
space and let v^, v^ , ,,« , v^ be another set of n vectors. Then the 
following formula is identically satisfied 
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2 n 


( Z (u u )( Z (v v )-( Z (u v )) -( Z (u.xv.)M Z (u.xv ))si z W(u. ,u.,v. ,vj 

i-1 1 1 i-1 1 1 i-1 1 1 i-1 11 i«l 1 1 2 i # j ci 1 J 1 7 

where the W function is defined by the formula of Theorem 1. 

Proof, Evidently the left member of the last formula may be written in the 
form 

n 

“ [(u.u i )(v.v ) - Cu.vKu.v ) . (u X V ) • (u X v )] 

■ L A j j 1 1 J J 1 1 j J 

By a known formula of vector analysis, we know that 

(U. * V.) • ( Uj * Vj) - ( u i u j )(v i v j ) ' Cu i v j )(v i u j J > 

while a simple manipulation of the summation indices i and j shows that 

" n 1 1 

1 (u i u i )(v i v i ) * 1 Cj Cu i u i )(v i v i } + “ tu.u )(v v )] 

i,j»l J J i f j-l * 1 1 J J " J J 1 1 

Hence the left member of our formula can be transformed into 

n 1 

. 2 • (Vp^jV * (u i u j )(v i v j ) * C^v.Hu.v.) ♦ i.( U .u.)(v.v.)] 

which, by the formula for K, given in Theorem 1, can be written 
n 1 

i.j-i 2 1 j 1 y 


and this finishes the proof. 

As a corollary to Theorem 1 and 2, we obtain the inequality 

n n n 2 n n 

(1) c r. (U.U.))C Z ( V - V • ) ) > ( Z (u,v.)) ♦ ( Z (u. X v.)) • ( Z (u. X V.)) 

i-1 1 i-1 1 1 i-1 1 1 i-1 11 i-1 1 1 

no matter what the vectors u. , u . v,, .... v may be. 

1 n* 1* * n 1 

Theorem 3 . The scalar W(a, b, c, d) of Theorem 1 vanishes whenever the 
vectors c and d are proportional to the vectors a and b, in the sense 
that there exists a scalar k such that b * ka and d = kc. 
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The proof of this theorem is left to the reader since it is an almost 
obvious consequence of the formula for W(a, b , c, d) , 

II. THE SUN OMAN INEQUALITY 

In the application to the n-body problem with the n masses m^, m,, , 
m n # let the position of the mass at time t be determined relative to 

a given coordinate frame by the nosition vector r. with components x. , y. , z.. 

i l* ' l * i 

The velocity of im is then r^ with components x^ , y^ , the dot 

denoting differentiation with respect to t. The Lagrange inertial radius R 
is defined by 


( 2 ) 


U ■ I m.fr.r.) , 

. , i v l i' 

i**l 


The kinetic energy T is given by 


(3) 


T - A l m. (r . r . ) , 

2 . , l v l i' 

i-l 


The angular momentum vector <J> is evidently 


(4) 


l m.(r, * r,) 

i-i 1 1 1 


Also, by differentiating (2) we have 


(5) 


RR 


If now we let u. - ni. 1/2 r i and v,. - m. A/2 r. # we see that (2), (3), (4) and 
(5) become 


T. m.fr.r.) , 

i-i 1 1 1 


I (u. u. ) 

i.l 1 1 


2T 


l (v.v.) 
•,ii 
i*l 
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= Z (U. x V. ) 

w 1 1 


RR » Z (u. v. ) 

,ii 

i*l 


Thus, substituting in our inequality (1), we obtain one form of the famous 
Sundman inequality, namely. 


( 6 ) 


2 2 *2 

2R T _> R^ \C ♦ (<fr . $) 


We may also write this in the form 


(7) R 2 * f2 * 3 = 2T 

R 

1/2 

where f ■ (<J> • <f>) is the magnitude of the angular momentum and Q is a 
positive scalar. Actually, by using the identity of Theorem 2 instead of' the 
inequality (1), easily derive the following explicit formula for Q 


Q ■ 7 


( 8 ) 


W(Ui ’ V V 


1 Z 

2 i.j-1 


IV ( m . 


1/2 


1/2 1/2 * 
m, r . , m. ' r. . 

3 3 i 


n. 1/2 r.) 
3 3 


So far we have made no use of the equations of motion for the n-body 
problem. The Sundman formulas (6) and (7), with Q defined by (8), hold for 
any system of n masses moving around in space with arbitrary velocities. As 
a consequence of the equations of motion, we know that we have ten first 
integrals, from which it follows that 


f * constant 

and that there exists a constant k (the negative of the energy constant) such 
that 

j2p2 

(1°) — * 2K - 2T . 

dt 

This last relation is one form of the well known Lagrange identity. On using 
(10) to eliminate the T in (7), we obtain 
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(11) R 2 ♦ 2RR + 2K >, f 2 R~ 2 , 

which is the form of Sundman's inequality used in our previous report entitled 
Rejection to Infinity in the Problem of Three Bodies when the Total Energy is 
Negative , 


III. AN ANGULAR VARIABLE I : OR R 

It is convenient to write (7) in the form 

• 2 p 2 

(12) R * 2 * 21 

k 

2 ^ 2 x* 

where p ■ f" ♦ () > f , We then define a monotoni cal ly increasing function 
6(t) by means of 


(13) 


0(t) * 0 n ♦ / p(s) R(s)’"ds 
0 


-1 


where 6^ is an arbitrar>' constant. Hence R(t) 0(t) * p{t) R(t) so that 
(12) becomes 

(14) R 2 ♦ R 2 0 2 ■ 2T 


while, of course, we also have 


(15) 

Now let 

(lb) 


r e » p > f > o 

X(t) * R cos 6 
Y(t) - R sin 0 


Then we easily find that 


Y 2 Y 2 n 2 

X ♦ i *11 


(17) 


•2 *2 *2 2 *2 
X ♦ Y - R ♦RO- 2T 


XY-XY-R 0 
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Hence 


(18) 

From (10) and (12), we have 


i tX 2 ♦ Y 2 ) . T and X Y - X Y » p 


•2 2 -2 d 2 R 2 , 

R ♦ p R » — y - 2h, 

dt 


where h * -K is the energy constant. From this we find that 

2 


(Rft* . 2h R ♦ - 2p . 


d_ 2 

dt 


Integrating we get 


RR 2 • 2h R ♦ p 2 R’ 1 « c(t) 


where 


c(t) « R 0 R q 2 - 2hR Q ^ p Q 2 R 0 ' i * / 2R(s) -1 p(s) ?(s)ds 

From (15) we now obtain 

R 2 + R 2 0 2 = c R* 1 + 2h . 


So from (17) we have 


• ? •*> -1 
X ♦ Y‘ . c R ♦ 2h 


X Y - X Y * p 


Differentiating these last two equations we find that 
(19a) 

Z R Z 

(19b) _YX + XY=p. 

In deriving (19a) we, of course, use the obvious fact that c s 2R 1 p p, We 
now solve (19) for X and Y, using the fact, derived from (17) that 
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RR * X \ + YY. In this manner we obtain the pair of equations 


( 20 ) 


-u X X p p 


R 3 R R it 


y . mJ. . Lilt. . Li 

R 3 R° R R R 


where 


1 

M - J C 


1 [D K 2 . 7 h R . JLi ♦ r p.l s I 

2 l * 0 K 0 * n 0 R 1 > Q R(s) 


Us . 


Thus, if p is small relative to R and R, then X and Y very nearly 
satisfy the equations for Keplerian motion. 

We now show that the information contained in (20), in the case of the two 
body problem, is equivalent to the reduction of the latter to Keplerian motion. 


Taking the origin at the center of gravity of the system, we have 
(21) m^r^ + n^r., - 0, ” 0 # 

Hence the vectors m^^ 2 and r 2 are P ro P ort ^ ona ^ t0 the vectors 

m^^ 2 r^ and m ? 1//2 r 9 . Hence from Theorem 3 and equation (8) we see that 

Q ■ 0; and therefore p ■ f, which is a constant. Hence equations (2) reduce 
to 


(22) 




where y is the constant 


•* * y l'V\> - 2 h R 0 * 


f* R. 


So far, we know only that y is a constant along each motion; but it might 
differ for different motions. We show now, however, that the latter is not the 
case. Indeed, we can show that y is a simple function of and only. 

To this end we write 


R • ? ? _2 

u - j [R ♦ f fc R - 2h] , 
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dropping the subscript 0 , which is not important, since the right hand member 
of the last written equation is almost obviously a constant along each motion. 

- 1 - 1 • 

Now by (15) we have f R » p R « R 6 , so that 

w - | [R 2 ♦ R 2 8 2 - 2h] . 


It now follows from (17) that \i » R(T - h) , Since h is the total energy of 
the system, h - T must be the potential energy of the system, which is, of 

course, - nunup -1 , where p is the distance between m. and m . It follows 
that 

(23) y ■ m 1 m 2 R p " 1 

By definition of R and by (21) we have 


(24) R 2 - m l | r ^ | 2 ♦ m 2 | r 2 1 2 = [rj 2 (m 1 /m 2 )(m 1 ♦ m 2 ) . 
Similarly 

(25) p » Irjl ♦ I r 2 1 - | Tj | m ^ 1 (nij + m 2 ) . 
Hence, from (23), we have 



If we orient our coordinate frame of reference so that the positive z-axis 
has the direction of the angular momentum vector, the motion takes place in the 
xy-plane and we obtain the equations 

(m + m )c (m + in ) n 

(26) ^ 3 , n 3 

P P 

for the relative motion of the two bodies, where £ and n are the coordinates 

2 2 1/2 

of E»j (say) relative to m 2 . The ratio ofp 3 (£+n) to R is seen 

1/2 1/2 

from (24) and (25) to be (m^ ♦ m ) /(m^) , By a discussion of the angular 

momentum it may be shown that tan”*(n/0 - 9 is a constant; and it is also not 
hard to prove that, if the axes are oriented so that this constant is zero, then 
the ratio C to X (and of n to Y ) oust be a constant and eeual to the 
ratio of p to R. It is then easy to see that (22) and (26) are equivalent. 
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SUMMARY 

The intuitive aspects of approximate decoupling of an n-body problem as 
exemplified in the solar system. Formalization of approximate decoupling in 
terms of the limiting values of certain parameters contained in the equations. 
Partial decoupling versus complete decoupling. Details in the approximate 
decoupling of an n-body problem into a k-body problem (k < n) and an 
(n-k+l)-body problem in the situation where k of the bodies are relatively 
close to each other compared to the other mutual distances and where also the 
k bodies have small masses compared to the other (n-k) bodies. The Hill 
equations for the motion of the moon as a special case of partial decoupling 
in the above situation with n = 3 and k = 2 . 

Theory of so-called quasi-first integrals which appear in the partially 
decoupled system and which are derived from first integrals of the unreduced 
system. Examples in the reduced and restricted problems of three bodies. 

A method for appraising the validity of an approximate decoupling (either 
partial or complete) based on the use of Lipschitz constants. The hyperbolic 
cosine appraisal for comparing the solutions of two systems of second order 
equations . 

There is a problem in applying this appiaisal to the n-body equations 
because of the collision singularities. If these singularities are excluded b 
restricting attention to a region in configuration space in which each mutual 
distance between pairs of bodies is not less than a fixed positive number 
assigned to the pair in question, the resulting region is not convex; and so 
there is still a problem in the estimation of the Lipschitz constant in terms 
of hounds for partial derivatives. This problem is solved for the appropriate 
functions occurring in the n-body pro!) lem by proving that in this special 
case the non-convexity of the region may be ignored. 
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INTRODUCTION 

One of the remarkable features of an n-body problem, of the type afforded 
by the solar system, is the ease with which the system can be approximately 
decoupled. Thus the motions of the planets about the sun are usually treated 
as perturbed Keplerian motions, as are also the motions of satellites about a 
planet. This approximate decoupling is possible when some of the masses are 
very small compared to others and/or when some of the mutual distances between 
the bodies are very large compared to the other distances. 

If we introduce parameters into the system in a suitable way, the ensemble 
of the decoupled systems may be regarded as the limiting case of the original 
n-body system when one or more of these parameters approach certain limiting 
values. For instance, a three body problem in which one of the masses is very 
large compared with the other two is partially decoupled into one Keplerian 
motion and one restricted (or reduced) problem of three bodies when one of the 
smaller masses approaches zero; and it is completely decoupled into two 
independent Keplerian motions when both of the two smaller masses approach zero. 
This is the type of approximate decoupling observed in planetary theory. Another 
type of decoupling occurs in linear theory where the distance between the two 
smaller bodies is small relative to their distances from the largest mass. 

Again we may take one of the parameters to be the smallest mass and we get the 
same partial decoupling as before, when this parameter tends to zero. But, for 
the other parameter, we take something which tends to zero with the ratios of 
the smallest distance to the two larger distances, and simultaneously modifies 
the time scale. There are several ways of doing this; but, if it is done 
properly, the three body problem will again be approximately decoupled into two 
Keplerian motions, one for the motion of the intermediate mass (the earth, say) 
about the largest mass (the sun) and the other for the motion of the smallest 
mass (the moon) about the intermediate mass (the earth). But the time intervals 
for the validity of these approximations may be vastly different. Considering 
only the situation where the two above Keplerian motions are elliptic, the 
approximations may be reasonably valid in each case for approximately the same 
number of periods, but the period associated with the smallest mass may.be but 
a small fraction of the period associated with the intermediate mass. 

In this paper we wish first to consider a method for the approximate 
partial decoupling of the n-body problem into an (n-k+1) -body problem and a 
modified k-body problem (k<n) when the k bodies are relatively close to 
each other compared to the other mutual distances. We also assume that these 
k bodies are small compared to the other n-k bodies. When n=3, k*2, this 

modified k-body problem is the problem formulated by the ilill equations, at 
least if the (n-k«- 1) -body problem (i. e. in this case, the 2-body problem) is 
given a circular solution. By introducing a second parameter and proceeding 
again to a limiting case we may achieve a complete approximate decoupling. 

Secondly we shall initiate a general theory of so-called quasi-first 
integrals which arise, in the partially decoupled system, from the first 
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integrals of the original system when the parameter takes on its limiting value, 
say 0. 

Thirdly, we give a general but rather crude method for estimating the error 
consequent upon using the decoupled or partially decoupled systems in place of 
the original system. 


I. DECOUPLING OF THE n-BODY PROBLEM 
1,1, Partial Decoupling 

The original equations of motion are written as follows in terms of 
position vectors q for the n-bodies (with masses •»«, m^) : 

(1) m q. - U i - 1, .... k 


( 2 ) 


a ■ k+1, 


where the dots represent differentiations with respect to the time t, where 
the subscript q’s in (1) and (2) indicate the gradient of U with respect to 
the relevant q, and where U itself is defined by the following formulas 

U « U* * V + W 


(3) 


m. m. 

*JL 


i«j |q j - q i 


n k 

V * l l 

a«*k+l i*l 


K ‘ q i 


1 * 


W * t 


8 * k+1, , , . , n. 


a<$ ^ 

Setting p ■ * m 2 + , , , + m^ f the center of gravity of these k bodies is 

given by 

(4) 


u 1 2) m.q 


j-1 


V) 


Letting r. » - q^, r a B % - V we see that r j “ r i * ' q i * 
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r - r. * q - q. , r rt - r ■ q_ - q , Hence the formulas for U , V, W remain 
a l l a $ a n a 

valid when the q’s are replaced by the r f s. It is also clear that 
k 

I m.r. * I m. (q . - q_) * I m.q. - I m.q rt * I m.q. - yq * 0 by (4). Hence 

... l l l l i n (r l l i 1^0 i^i p 

1*1 u 


(5) 


Z m. r . * 0 . 

i.i 1 1 


.. .. .. .. - 1 
Evidently - m.q 0 - m i q i - nuy I m.q. 

j-1 J J 

-I k 

* U - m.y Z U 
r. l . . r. 
i J-1 J 

It is easy to see by direct calculation that 
k . n 

(6) Z U * 0 and Z W * 0 . 

£*1 T t B*k+1 r B 


Hence 

(7) 


J i * U ♦ V - m.y 1 I V 
* r. r. i . . r. 

ii J-1 J 


Similarly, we find that 


(8) 


-1 


m r * V + W -my IV 
a a r r a . . r. 

a a j*l j 


The equations of the n-body problem are easily seen to be invariant under 
a transformation which multiplies each distance by a parameter s provided that 

each mass is correspondingly multiplied by s 3 . Hence, if r ^ +1 # •••# r n are 

large compared to r^, r^, and if one wishes to exaggerate the comparative 

largeness of the former to the latter, it would appear desirable to multiply 

each mass by s 3 and each of the vectors r^, •••• r n s » t>ut * cave 

Xj# . . untouched. One then examines the effect of allowing s to 

approach infinity. Carrying out these modifications on (8), and making use of 
the formulas of (3) for V and W, we obtain 
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. k s 6 m.m (r. - sr ) s 6 m m (sr. - sr ) 

» s 4 ? . r i-2_i -SL. ♦ £ “ 8 s 


o a . . i 1 3 

i=l r. - sr 
1 l a 1 


8j<a 


l Sr 3 - Sr J' 


v g n 6 J 

3*1 6* k *l kj * sr 6 | 3 


Dividing by s m and then letting s ■+ ®, we obtain 
a 

yr 


(9) 


„ r _ l Vb 


I 3 8*a |r B - rj 3 B-k*l |r g | 3 


These are precisely the heliocentric equations of notion for n-k+1 bodies, 
namely the n-k bodies 1 \ +1 » m n referred to a hypothetical sun with mass 

y * + ... + m^, placed at the center of gravity of the bodies m^ , •••» 

nij. . Equations (9) thus approximately describe the notion of the n-k bodies 

m k+l* m n anti the center mass the k bodies m^, . .., m^, while 

neglecting [ ^ | , . . . , | r k | in comparison with |r k+1 |, |r |, but not 

neglecting the total attraction of .... on the other n-k bodies. 

If we also wish to neglect the masses m^, , , . , m, in comparison with 
«.», m^, we follow the procedure as above except that we do not introduce 

the factor s' 5 in connection with each of the k first masses, but only with 
the remaining masses. We then find, on letting s *•> that 


( 10 ) 


|r B - r 


rj 


z 

8=k+l 


which, of course, is the same as (9), except that the first term is omitted on 
the right hand side. 


We now investigate the behavior of equations (7) under these operations, 
neglecting m^, ..., in comparison with m^^, •••* m n as we ^ 35 

r l* *** # r k * n com P ar i son with r n * The equations may be written 

after the iili. equation is divided by itu in the following manner: 


147 


APPROXIMATE DECOUPLING IN THE n-BODY PROBLEM 


m. (r. - r. ) n m (r - r. ) 

J— i ii * t -2-2 i- - y- 


J*i I r j - rj 3 


a=k + 1 r - r . 


k n m.m (r - r.) 
y r z 1 a W - 
j-1 a=k + 1 |r a - r. I 


Hence multiplying r a by s (a *» k + 1, . n) and m by s and then 
letting s ■+ <», we get 


m. (r . - r. ) 
r. - Z -ii- 1 ’ 


(ID 


1 |r. - r i 


T * 


n * V sr a - V -1 


lim [ Z 
s-~> a-k + 1 I sr - r. I 


k n m.s m (sr - r.) 

Z Z i a - 

1*1 a°k+l Isr - r.| 

J 1 a j 


In order to evaluate the limit s ■+ 00 , we introduce o ■ 1/s* We then 
find that 


s m (sr - r . ) 
a ot i 


. k m . s m (sr - r . ) 

-i . j « ..q iL 


L 

j-1 


, m (r - or.) . k m.m (r - or.) 
-1 r ex'* a V -1 „ 3 cT a J , 

» 0 [ u l 

I r - or. I j-1 I r - or 

1 a l J a j 1 


The quantity in the bracket is now expanded in a power series in o « It is 

k 

obvious, in virtue of u - I m . , that the constant term is zero, while the 

j-1 3 

coefficient of o is readily computed to be 


m r . 3m r (r • r . ) 

a l a a v a l 


In arriving at this result we use (5). It follows that (11) may be written in 
the form 


(12) r. - E 


m . (r , - r . ) 

_2_J ii 


1 l*j - T t \ a - k+1 


m r. 3m r (r • r . ) 

, a l a a a l 

(- r + f ) * 

i _ i 3 i _ I 5 
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1.2. The Hill Example 


We now indicate the manner in which the Hill equations are special cases of 
equations (12), We thus assume n » 3, k * 2, so that k + 1 * n «= 3, We also 
restrict attention to the planar problem. Equation (10) reduces to 


(X3) 


since the only value that a and B are allowed to take on is 3, This is, of 
course, just the system for Keplerian motion. Suppose we take * 1 and 

consider the particular solution x^ * cos t, y^ = sin t. So | | * 1. 

Hence equations (12) become 


m 2 (x 2 -x 1 ) 

X 1 - 77 7 7072 - x i t3 co st C*1 cos 1 * X! sin t) 

[(x -X ) + (y 3 -y,) ] 

(14) 

m 2 ( y 2 -yi) 

y ! " 77 7 — : 727172 - y ! + 3 sin tc X 1 cos 1 + y i sin 

[(x 2 -x x ) ♦ ( y 2 - y 1 ) ] 


m 1 C x 1 ' x 2 ^ 

x = s * _ - x + 3 cos t (x_ cos t + y. sin t) 

2 t(x r x 2 ) 2 ♦ ( yi -y 2 ) 2 ] 3/2 fc 2 2 

(15) 

"i 1 Cy 1 -y 2 ) 

y - T -572 - y 2 ♦ 3 sin t (x cos t * y 2 sin t) . 

[(x 1 -x 2 ) + Cy r y 2 } 3 


From (5), we have in addition m.x^ * m 2 x 2 * 0 and m^y^ + m 2 y2 = 0* 30 that 

(15) and (14) are not independent of each other. Setting x * x^ - x 2 and 
y - yi - y 2 , we find on subtraction of (15) from (17) that 

-(m 1 *m 2 )x 

x * — 2 — — y 1 - x ♦ 3 cos t (x cos t + y s in t) 

[x +y r 

(16) 

-(m 1 +m 2 )y 

y * — Tj — - 2 - j_ - y ♦ 3 sin t (x cos t + y sin t) . 

[*♦/]' 
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Setting z * x ♦ iy, we can write (16) in the more compact form 


(17) 


-0Vm 2 )z . it , 

z m ■ — -5 z ♦ 3 e R(z e ) . 


ur 


-it 


We introduce rotating coordinates at this stage by putting C ■ z e , so that 


z * ; e 


h z - (t ♦ic)e lt , i * (E ♦ 2i t ■ c)e lt , and |z| - |c|. 


it 


Substituting in (17) and suppressing a factor e , we get 


-(m ♦m )c 

c + 2i c - C * 7 ; * 3R(C) 


Hence, setting C * £ + i n and separating real and pure inaginary parts, we 
find that 


C - 2n 


-(m 1 + m 2 )e 
[c 2 * n 2 ] 3/2 


♦ 34 


-(raj + m 2 )n 
n + 2 t - — ^ 071 

tr ♦ n 2 r 

which are Hill's equations. They appear in the more standard form 

x 


(18) 


x - * ’ 3X - T 2 2,3/2 

(x +y ) ' 


y * 2x ■ - ; i hjn 

(x +y ) 


if we take £ * (n^ + m 2 ) 1//3 x and n ■ (m^ ♦ m 2 ) 1//5 y. The present x and y 
are, of course, different from the x and y of (16). 


1.3. Complete Decoupling 

In carrying out the complete decoupling we observe that, if the k bodies 
are very close to each other compared to their distances from the other n-k 
bodies, their attraction (per unit of mass) on each other is much greater than 
on the other n-k bodies. This will cause greater relative accelerations of 
the k bodies than of the others. In order to exaggerate this effect we 
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introduce the small parameter X in such wise as to change the time scale as 
well as to multiply r^ . .., r^ and simultaneously to render those terras on 

the right of (12) which involve the other r's to be very snail compared with 

the other terms. This is achieved by multiplying r * * • * * * by and t 

3 

by A , and leaving r k+i » ...» and the masses untouched. When (12) is 
thus modified, we find that 


r 4 1. ■ x " 4 z i '■ 


m.(r - r ) 


i r j - r i 


A 2 l 
a«k+l 


m r. 3m r (r • r.) 
a l . a a v a i\ 
t * . *; J • 


First multiplying by A and then letting A ■+ 0 we get the complete 
decoupling with (12) replaced by 


(19) 


m, (r . - r. ) 

e -*-2 — ii 

i* Ip - r.| 3 


1, ..., k. 


which are precisely the equations for the k-body problem. 


Notice that the same type of reduction occurs if we take the Hill equations 
(18) in rotatingcoordinates. If we have multiply x and y by A 2 and t 

by A 3 , these equations (17) take the modified form (after multiplying through 

by A 4 ) 


* - 2A 3 y 


3A^x - 


(X* 


2,3/2 

y ) 


y + 2A x * - 


(x 


yV /2 


which reduce in the limit A -► 0 to the equations for Keplerian motion. 

It does not matter in such approximations whether rotating or non-rotating 
coordinates are used. Correspondingly the resulting approximations can be 
expected to be valid only over time intervals that are short relative to the 
period of the rotation although they could be long relative to the period (say) 
of the Keplerian elliptic motions representing motions (in the Hill case) in 
which m^ and m^ are very close to each other. 

All of this is not in the least surprising, since the same change in time 
scale applied to the equation (10) modified the latter merely by introducing a 
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factor X‘ b on the left. Hence, on multiplying by X^ and then allowing A 
to approach zero, we get 

(20) r - 0 , a * k + 1, . . . , n 

a 

as the limiting form of the other decoupled system. The time interval for a 
valid approximation of (12) by (ID) would presumably be comparable to the time 
interval for a valid approximation of (10) by (20). 


1.4. Mathematical Interpretation 

The above intuitive discussion, about the "introduction of parameters" and 
the "exaggeration" of certain effects by allowing the parameters to approach 
their limits, may have had the desired effect of minimizing the complication of 
formulas; but it has resulted in a logically inconsistent notation; and it may 
possibly have baffled the reader in other respects as well. In order to 
clarify this situation we offer the following mathematical interpretation of 
the main result of this Section. 


Consider the n - body problem with the n masses denoted by m^ , m^ , ..*« 

T T T 7 

m. , s m, , . s m. s m. ~ , ..., s m . Let the position vectors of these n 
k k+1 * k*2’ k« 3 * n 1 

bodies relative to the center of gravity of the first k of them be denoted 
respectively by A^r^, ..., sr k+l* ■ **» Sr n ant ^ ^ enotet * 

by X 3 t. We regard s and X as constant (scalar) parameters. In this rather 
unfamiliar notation the usual equations for the n-body problem (taking the 
gravitational constant to be unity) may be written out. Thus the r's, 
considered as functions of t, are found to satisfy the system, 


( 21 ) 


dt fc 


Z 


m . (r . -r. ) 

J- ^ \... 

|r.-r.| 3 
1 1 i 1 


n 

Z 

a«k*l 


4 3 , 2 , 

X s m (sr -X r. ) 
a a l 



-1 

U 


k 

Z 


j-1 


n (sr -X^r.) 

r 2 2_ 

a=k + l | sr - X r . | 

1 a y 
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(22) 


2 . * 1.6 , , 2 . 2,6 , 

dr k s X m . (X r.-sr ) s X Tn n (sr 0 -sr ) 

a „ J J „ 8 v 8 cr 

— =- - 1 s — i \ + Z 5 

dt j-1 l^j-* r a r Ma |sr B -sr a | 3 

. 2 . ,2 . 

k n s m.m.fsr^-X r.) 

- xV 1 i r ■ ^£-2-^ 


j-1 S»k*l |x 2 r.-sr g | 3 


where y * * ... + m^. Letting s -► ®, equations (21) take the form 

n , mr. 3mr(r • r. ) 

♦ l x 6 (- -2-i_ ♦ — £L_SL_2_ L) . 


d 2 r. 

(23) — ■= Z 


dt^ jfi ! x-j - | 3 a-k+1 |rj 3 I | 5 


and equations (22) take the form 
(24) 


d 2 r 


r, ft m (r ft -r ) 

Ci B £ £ 8 8 a 

dt A S^a I r e - r I 3 B-k+l I r . 1 3 


n m r 

£ -Mr) 


8 


B 1 


Equations (23) and (24) are, of course, mere modifications of equations (12) 
and (10) respectively. Namely they contain the parameter X, since the time is 

here denoted by X 3 t instead of t as in (12) and (10), Partial decoupling 
has been achieved in the limit s -*■ », since the equations (24) are independent 
of r^, But the equations (23) for X ^ 0, still contain all n of 

the unknown vectors. Complete decoupling only occurs in the limit X 0, 
leading to the systems displayed in formulas (19) and (20), 


II. QUASI FIRST INTEGRALS 
11,1. Definition 

Consider the system 

(1) * - f(0» x, y), y - g(a, x, y) , 

where x and f are N- vectors, where y and g are K-vectors , where o is 
a scalar parameter, and where the dot denotes differentiation with respect to 
the independent variable t. We also assume that f and g are of class C* 
in the region where solutions are considered. The system (1) is evidently of 
order N ♦ K. 
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Suppose that this system is partially decoupled when a takes on a 
specific value, say 0, in the sense that g(0, x, y) is independent of x. 
We write 


(2) 8(0, x, y) - G(y) , 

where G is a K-vector, Let y(t) denote any fixed solution of the 
"decoupled" system 

(3) y - G(y) 

and consider the "non-homogeneous variational equations" 

(4) Y « G y [y(t)]Y ♦ g o (0, x, y(t)) , 

where Y is a K-vector, where G^ is the Jacobian matrix of the components 
of G with respect to the components of y, and where g^ is the partial 
derivative of g with respect to a. 


A function F(t, x, Y) is_ said to be a_ quasi- first integral of the system 

(5) x - f(0, x, y ( t) ) 

of order N, if_ vt is_ JLH ordinary first integral of the (N ♦ k)th_ order 
system consisting of (4) and (5) . 

In other words, if x(t) is any solution of (5) and if the pair 
[x(t), Y(t)] satisfies (4), then 

(6) F[t, x(t), Y (t) ] = constant. 


Since (S) does not involve Y and since (4) is linear in Y, it is easy to 
express Y by quadratures in terms of x. In fact, if G(t) is a fundamental 
matrix solution of the homogeneous variational equations, 


a - G y [y(t)]n , 

then any solution of (4) must assume the* form 


Y(t) 


n(t) [c ♦ / Ji(s) 

o 


8 o (0, x(s), 


y(s)) ds] , 


where c is a suitably chosen constant k-vector. Conversely, if c is an 
arbitrary constant K-vector, the Y(t) given by the last formula satisfies (4). 
Hence inserting this expression for Y(t) into (6), we see that our definition 
of a quasi-first integral of (5) implies that 
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i 

F(t, x(t), a(t)(c ♦ / n(s)' g (0, x(s), y(s))ds)] = constant , 

0 

no matter how the constant K-vector c nay be chosen. 

We first wish to discuss this definition with respect to the system (21) 
and (22) of the previous section. These form a system of 5n equations, each 
of the second order, in the components of the r’s. However, we have taken the 
origin at the center of gravity of n^, m k , so that we have three known 

linear relationships among the unknown vectors r^ . .., r^. These relation- 
ships render three of the equations (21) redundant. Thus the system (21) and 
(22) provide only 3n - 3 independent equations of the second order, 3k - 3 
from (21) and 3(n - k) from (22). These second order equations may each 
be replaced by two first order equations by the familiar devise of using nhase 
space instead of configuration space. Equations (21) are therefore equivalent 
to a set of N » 6k - 6 first order equations and equations (22) are 
equivalent to a set of K * 6(n - k) first order equations. It is with these 
understandings that we consider the system consisting of equations (21) and (22) 
of the preceding section as an example of the system (1) of the present section. 

We take s * o" 1 , so that as a 0, the system becomes partially decoupled as 
already explained. 

The full system of N ♦ K * 6n * 6 equations admits four well known first 
integrals corresponding to the energy and the three components of angular 
momentum. (The other six integrals of the n body problem, namely those 
corresponding to the linear momentum, are not available because of the choice of 
our non-inertial coordinate system. They would be used for obtaining positions 
with respect to an inertial system in terms of unknown r’s). 

The quasi-first integrals were invented to investigate what happens to 
these first integrals in the partially decoupled system. Briefly as a ■+ 0, 
all four of these first integrals reduce to first integrals of the system (24) 
of the previous section; while the system (23), considered for a fixed solution 
of (24) in which r Q (t) are regarded as known for a * k ♦ 1, ..., n, has, in 

general, no first integrals whatever. It does turn out, however, to possess 
four quasi-first integrals in a manner described in a more general setting below. 


II. 2. Theorems on Quasi-First Integrals 


In the sequel we suppose that the system (1) admits 
h(o, x # y) and that h(0, *» y) * s independent of x. 


a first integral 
Let 


(7) 


h(0, x, y) * H(y) . 
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We also assume that both h and 11 are of class C M , 

Theorem 1 . If y(t) is any fixed solution of (3), then the function 

(8) l ; (t # x ( Y) - h o (0, x # y(t) ) ♦ n y (y(t))Y 
is a quasi-first integral of the system (5), 

Proof . Since h(a, x, y) is a first integral of (1), we have 

h x (a, x, y) f(o, x, y) ♦ h y (o , x, y) g(°, x, y) = 0 

Differentiate this with respect to o and then set o *• 0. Remembering (2) 
and ( 7) , we find in this way that 

(9) h ox (°» y) f C°. x » y) ♦ h oy c °' x » y) G ( y ) * M y(y) g a (°» x » y) = 0 * 

since h^CO, x, y) « 0 by (7). Also the system (3) admits the function H(y) 
as a first integral. Thus, we have the identity 

H y (y) C(y) S 0 . 

Differentiate this with respect to y and then set y » y(t). We thus get 

H yy (y(t)) G( y(t)) ♦ H y (y ( t) ) t; y (y(t)) = 0 

l-'orm the inner product of the left member of this last identity with the vector 
Y, We thus obtain 

CIO) Y H (y(t)) C(y(t)) ♦ H y (y(t)) G y (y(t))Y = 0 , 

or more briefly 

(11) [11 C * II C ]Y = 0 , 

^ J 1 yy y y J 

Now evidently, from (8) and (3), we have 

(12) §£ - [ h oy (U, X, y(t)) ♦ H yy (y(t))Y] G(y(t)) 

(13) f(0, x, y(t)) - h ox (0, x, y(t)) f(0, x, y(t)) 

(14) (G v (y)Y ♦ R o (0, X. y(t) ) ) - ll y (y(t)) «; y (y)Y ♦ g o (0. X, y)} . 
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We now add (12), (13), and (14). On the right hand side the underlined terms 
cancel out because of (9). The other terms cancel out because of (11) and the 
fact that H GY * H YG, the Hessian matrix H being symmetric. Thus we 
find that yy yy yy 


3F 

at 


1 I * 

ax * 


(o, 


x, y(t)) 


{G y (y(t))Y ♦ g(j (0, X, y(t})} 


o . 


Hence F is a first integral of the system composed of (4) and (5), and hence, 
by the definition, it is a quasi-first integral of the system (5), as we set out 
to prove. 

In the previous subsection we indicated how the Y can, in general, be 
eliminated from a quasi- first integral by use of quadratures acting on the 
unknown x, The method involved the knowledge of a fundamental matrix solution 
of the homogeneous variational equations. In the particular instance treated in 
Theorem 1, where Y occurs only in the combination H (y(t))Y as in (8), this 

elimination is much easier. The essential facts are displayed as follows: 


Theorem 2. If y(t) is any fixed solution of the system 


(15) y " G(y) , 

which has the first integral H(y), and if Y (t) satisfies the non-homogeneous 
variational equations, 

(16) Y - G y (y(t))Y ♦ K(t) , 
then 


t 


(17) H (y(t))Y(t) - / II (y(*))K(s)ds ♦ H (y(t ))Y(t ) . 

y t Q 

In particular H y (y(t))Y is a first integral of the homogeneous variational 
equations (cf. special case K(t) = 0). 

Proof , Since H(y) is a first integral of (15), we know that H y (y)G(y) = 0 

is an identity in y. Differentiating with respect to y and then setting 
y * y(t) , we obtain the following identity in t: 


Hy y (y(t)) G(y(t)) * Uy(y(t)) Gy(y(t)) = o . 

Forming the inner product of the left side of this identity with the vector 
Y (t) we obtain 
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(18) H yy (y(t)) G(y (t) ) Y(t) ♦ H y (y(t)) G y (y(t)) Y(t) = 0 . 

On the other hand 

[H y (y(t)) Y(t) ] • H (y(t))y(t)Y(t) * ll y (y(t))Y(t) . 

Hence, from (15) and (16), we find that 

[H y (y(t))Y(t)] - ll yy (y(t))G(y(t))Y(t) ♦ H y (yCt))G y (y(t))Y(t) ♦ H y (y(t))K(t) . 

But the first two terms on the right of this equation cancel because of (18). 
What is left is equivalent to the stated theorem. 

Theorem 3 . Under the hypotheses of Theorem 1, let x(t) be an arbitrary 
solution of (5). Then 


h a (0, x(t), y(t) ) + / H y (y(s))g o (0, x(s), y(s))ds ■ constant. 

*0 

Proof . Since the F of formula (8) is a known quasi-first integral of (5), 
it is known at once that 

(19) h a ( 0 # x(t), y(t)) ♦ H y (y(t)) Y(t) ■ constant , 

where Y(t) is any vector satisfying (4), x being set equal to x(t). But we 
then know from Theorem 2, with K(t) * 8 a (° * x(t), y(t)), that 

t 

H (y(t))Y(t) * / H (y (s) ) g (0, x(s), y (s) ) ds + constant . 
y t k o 

Inserting into (19), we obtain the result to be proved. 


II. 3, Application to Reduced 3-Body Problem 

The detailed application of the theory of quasi-first integrals to the 
systems of the previous Section has not been carried out. Instead we present 
the following outline of how the application may be made to the planar reduced 
three body problem and in particular to the restricted problem, at least to the 
point of producing, in this example, the constants of motion referred to in 
Theorem 3. 

We consider the planar three body problem with three masses o, u, 1 - y. 
We use two frames of reference, one with origin at the center of mass of the 
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three bodies, and a parallel frame with origin at the body with mass u. We let 
, x 2 be the coordinates of a with respect to the first frame, and we let ' 

y l* y 2 be the coordinates 1 - u with respect to the second frame. Then it 

is elementary, though somewhat laborious, to show that the full planar three 
body problem may be reduced to the solution of the following system of 
differential equations: 

x - - p[(0 ♦ l)x ♦ (1 - u)y.]R“ 3 - (1 - m)[(o ♦ l)x - u yjr" 3 

( 20 ) 1 

i - 1, 2. 

y'i « - ♦ y 2 2 ]’ 3/2 ♦ at(o+l)x. - ay i ]r’ 3 - o[(o^l)x i ♦ (l-tOyjR* 3 

( 21 ) 

i - 1, 2. 


where, 

( 22 ) 


both in (20) and in (21), we have used the abbreviations, 
r 2 « [(a*l)x 1 • uyj] 2 ♦ [(a»l)x 2 - uy 2 ] 2 
R 2 * [(o+IJXj ♦ (l-uj/j] 2 ♦ [(o*l)x 2 * (l-u)y 2 ] 2 


The four second order equations displayed in (20) and (21) are equivalent 
to eight first order equations, which form the system to be identified in this 
example with the System (1). Thus N = K * 4 and # x and y are to be t # 
regarded as four vectors with components (x^, x 2# Xj, x 2 ) and (y 1# y 2 » y i» y 2^ 

respectively. The system has two first . integrals, 


h (1) * a(a+i)(x 1 x 2 - x^) * uU-uHy^ - y 1 y 2 ) 


and 




-72 2 

Vy *y 


Thus, we obtain 
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( 23 ) 


h a J x, y) =» x x x 2 - XjX 2 


(1) 
J 

h < 2 > 


V (0. X f y) . * (X 2 ♦ x/) - f ^ 

0 0 

where R Q and r^ are obtained from the expressions for R and r respect- 
ively by setting o * 0 in (22). 

^ ( y) - h^(0, x, y) - u(l-u)(yjy 2 - YyY 2 ) 

(24) 

l|(2) (>) * l> (2) (0. x. y) - i. u(l-u)(P, : ♦ yj) - aLtHL- 

J 2 2 

/y l * y 2 

The system (3) in this example is represented by (21) with o * 0, namely 


(2S) 


y i * - ^i^i 2 * y 2 2 ]‘ 3/2 . i * i, 2. 


and the system (5) is represented by 


(2o) x i . - u (x. ♦ O-uJypUy’ 3 - (l-uJtXj - uyjjry" 3 , i • i. 2, 

these being the equations for the reduced problem of three bodies when y ^ and 
y 2 arc thou 8 ht of as being known functions of t satisfying (25). The four- 
vector g (0 # x , y) is readily seen, in this example, to have the components, 


°* °* (x l ' My l )r o " 3 ' (x l * U-^ypR,, 3 . (x 2 -uy 2 )r 0 ' 3 - (x 2 »(l-u)y 2 )R 0 * 3 . 


According to Theorem 3, we should have two constants of motion, which we shall 
denote by and c 2 , given by the formula, 

c i * h o U)(0 » 4 / H v (l) (y(s)) n CO, x(s) , y (s) ) ds , i - 1, 2. 

t 7 0 

0 

Wo now have available all the data necessary for the use of this formula, namely 
the four components of * o (0, x, y) , and the expressions for h o (i) (0, x, y) 
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and H ”'(y) , given by (23) and (24) respectively. We thus obtain 

r * -3-3 

(27) Cj - XjX 2 - XjX 2 * u(l-u) / (x 2 y x - x^H^ - R Q )ds 


’ T^i 2 * J 2 2) ' - -TT 1 * ' J(1 -' l) ! C Vl * V2 )(r ( 


0 0 


o ' 3 - R o' 3 > ds 


(28) 


j(l-u) / (y^j ♦ y 2 y 2 )(u r Q “ ♦ (i-u)R 0 " 3 )ds 


where the symbols x i# 5^, y i y i# r Q , R Q , occurring outside the integral signs, 

represent functions of the independent variable t; but, when they occur 
inside the integral signs, they represent the corresponding functions of s, 
the variable of integration. 


In the special case of the restricted problem of^three bodies, in which we 
take y x Ct) * cos t and y 2 (t) * sin t, we have y^ + y 2 y 2 * 0, so that 

the second of the two integrals in (28) drops out completely. Moreover 
9 ’ y 2 and y 2 9 y V This means tiiat the first of the two integrals in 

(28) is the same as the only integral in (27). Thus we find that 


(29) <= 2 - <=! * 7 C^ 2 ♦ i 2 2 ) ' Cx 2 i 2 - JjX 2 ) - 


(30) c x • x 1 x 2 - x^ 2 + u(l-u) / (x 2 (s)cos s - x 1 (s)sin s) (r 0 “ 3 - R Q ‘ 3 )ds . 


Thus in this particular case, we have a first integral in the ordinary sense, 
namely the integral of Jacobi. In rotating coordinates (£^, 5 2 )» connected 

with the non-rotating coordinates by the transformation, 


( x i 


x 2 ) 


Om 


c 2 ) 


the equations (26) , in which y^ 


cos t and y 2 « sin t, become 
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*1 ' 2 ^2 


• 


dC^+l-u) (1-uH^-u) 


(31) 


C 2 * 2 tl 


C2 ' 7 s 

R 0 


ut 2 (l-y)C 2 


where Rg and have the same meaning as before but are now expressed in 

terms of (C^ • $ 2 ) instead of (> 1( x 2 ) , so that 


2 2 2 
K n - ♦ C/ 


and 


2 2 

( V p) * C 2 


These are the usual equations for the restricted problem of three bodies. 
The equation (29) is transformed into 


1 2 • 2. l fr 2 r 2. u (1-u) 

2 Ct l +C 2 5 ' 2^1 *^2 5 “ R q rJJ C 2 * 1 * 

which is the usual form of the integral of Jacobi. Finally the equation (30) 
is transformed into 


q 2 ♦ C 2 2 ♦ qt 2 - «J « 2 * li(l-li) / C 2 (s) [r 0 ‘ 3 - K 0 " 3 ]ds - Cj . 

t o 

The fact that the left member of this equation is a constant of the motion is 
an easy direct consequence of (31) . 


III. HRROR ESTIMATION 
111,1. The Hyperbolic Cosine Estimate 

We wish now to present a method for estimating the error in the computation 
of trajectories by using such decoupled systems, as those considered in Section 
I, instead of the exact systems. Actually the method applies to any kind of 
approximation, whether partial or complete decoupling occurs, or not. The 
problem is formulated as follows: 

Suppose we have two N-vector functions f(x) and g(x) of the N-vector 
x, defined and continuous in some region R of N-vector space. Suppose also 
that f and g are approximations to each other in the sense that the norm 
of their difference is bounded throughout R by some positive number 6. The 
smaller 6 is, the better the approximation. We thus assume that in R 
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(D ||f(x) - S(x) 1 1 <6. 

The norm ||...|| referred to may be chosen arbitrarily, so long as it has the 
usual properties of a norm in N- dimensional vector space. 

Let x(t) e R and y(t) c R be solutions of the systems 

d 2 x 

( 2 ) —r ■ f(x) 
dt 

and 

(3) = g(y) 
dt" 

assuming the same initial conditions 

(4) x(0) = y (0) * a , say, and x(C) - y(0) = b , 

and defined on the interval 0 < t < T, The problem is to find as small an 
upper bound as possible for ||x(t) - y(t) | | . 

In the application to the n-body problem, N ~ 3n , and the region R is 
a region of configuration space in which the distance between each pair of the 
bodies exceeds a positive number assigned to such pair. 

We could write the systems (2) and (3) as systems of first order equations 
by doubling the dimensionality of the space, and then we could read off from 
the classical literature estimates of the required type, at least, if one of 
the vector functions f or g satisfies a Lipschitz condition, 

( 5 ) I I f(x') - f(x) || 4 U | | x* - x| | . 

We prefer, however, to adapt the classical methods directly to the systems of 
second order equations in order to get better results, (cf. L. Kamkc, 
bifferentialgleichungcn , Losungsmethoden , und Ldsungen. Chelsea Publishing 
Company, New York, 1948, pp, 40-41,) We shall indeed assume that (5) holds as 
long as x and x’ are both in R, and, for ease in later formulations, we 
introduce the following definition. 

An N-vector function f defined in a region R of N’-dimensional vector 
space and satisfying the Lipschitz condition (5) with Lipschitz constant B is 
said to satisfy the extension Hypot h esis relative to R and R t if for everv 
positive number A > B it is possible to define an N-vector function f 

A 

throughout the whole of N-dimensional vector space such that 
< 6 > f A (x) = f(x) 
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fo r all xcR and 

(7) 


||f A (x') - f A (x) I I < A | | x' - X | | 


for any two N-vectors x and x' , whether in K or not. 

Theorem 1, If f satisfies the Extension Hypothesis, relative to R and B, 

and if (1), (2), (3), (4) are also assumed, then 


( 8 ) 


| |x(t) - y(t) | | <, 6B" 1 (cosh (B 2 t) - 1) for 0 < t < T. 


Proof. x(t) and v(t) satisfy respectively the systems of integral equations 

t 

(9) x(t) - a * bt + / f(x(s)) (t-s)ds 

0 

and 

t 

(10) y (t) « a ♦ bt ♦ / g(y(s)) (t-s)ds 

0 

which under the initial conditions (4) are equivalent to the differential 
systems (2) and (3). Let 

x Q (t) » y(t ) for 0 < t < T 


(H) 

t 

x fc (t) » a ♦ bt ♦ / f A ( x k .i(s))(t-s)ds, k • 1, 2, ... . 

We note that all these successive approximations Xj(t), x 2 (t), ... must exist 

for 0 _< t £ T even though some or all of them do not stay within the region 

R. For the function f. is defined and continuous over the whole N-vector 
A 


space . 

It is easy to prove 

that 



(12) 

lira 

\ (t) ■ 

= x(t) 

uniformly on [0, T] , 


In fact , 

, from (11) and (7), 

we find that 



H x k*i Ct) - \ (t) 

II < Aj 

t 

f iix k (t) - x k-1 (t)ii 

(t-s)ds 
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If we now let u = max Hx^t) - x 0 (t)|| over the interval [0, T], we may 
now prove easily by induction that 

ll* kn (t) - XijCOll 4 A k u t 2k /(2k)l for t e [0, T] . 

It follows from the Weierstrass test that the sequence (x k (t)) converges 
uniformly to some vector function x*(t) , and, since, from (7) , 

H f A Cx k (t)) ' **<**(*» I I - A H X k (t) ' **(t)||, 
it is also obvious that 

ki!! f A^ X k^^ “ f> A ( x *C t )) uniformly on [0, T] . 

Hence passing to the limit as k », we see from (11) that 

t 

x *(t) ■ a ♦ bt ♦ J f A (x* (s)) (t-s)ds . 

0 A 

This means that x*(t) satisfies the system x - f A (x) and the initial 

conditions (4). But x(t), because of (6), also satisfies the sane conditions. 
Because of the uniqueness theorems covering such solutions, it follows that 
x*(t) = x(t) . Thus (12) has now been established. 

From (10) and (11) we are enabled to write 

t t 

y(t) - x k (t) - / g(y(s)) (t-s)ds - / f (x . (s) ) (t-s) ds . 

0 0 

whence, using the fact that y(s) e R so that f A (y(s)) * f(y(s)) by (6), we 
find that 

, 1 t 

y(t) - x k (t) * (g(y(s)) - f (y (s)) ) (t-s)ds ♦ / (f A (y(s)) - f A (x k-1 (s)))(t-s)ds. 

Using (1) and (7), we now find that 

| |y(t) - x (t) 1 1 < (6/2)t 2 ♦ A / | | y £ s) - x. (s) | | (t-s)ds. 

0 
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From this we prove routinely by induction that 


I |y(t) - x k (t ) \\ <j 


k 

l 

p-1 


A*V P 


Hence allowing k -> ®, we find, from (12), that 


||y(t) - x(t) | | 4 | 


z 

p-1 


P*2p 


A 1 t 


6 A 


(cosh(A 2 t) - 1) . 


Since this is true for all A > B, we find easily that (8) must hold as stated 
by the theorem. 


Theorem 2 . The inequality (8) can not be improved under the hypotheses of 
Theorem 1. 


Proof . We take the example in which N ■ 1, ||x|| « |x|, f(x) - Bx, 

g(x) ■ Bx + 6 , and R is the set of all real numbers. Thus if (2), (3), and 

(4) are to be satisfied, we find at once that the difference y(t) - x(t) * w(t), 

say, must satisfy w ■ Bw ♦ 6 together with the initial conditions 

w(0) ■ w(0) ■ 0. Integrating, we find that 

i 

y(t) - x(t) * w(t) * 6B 1 (cosh (B 2 t) - 1). 

so that it is possible in particular examples for the equality sign in (8) to 
hold. 

Theorem 3, Under the hypotheses of Theorem 1, we also have 

I 1 

(13) ||i(t) - y(t)|| < 6B 2 sinh(B 7 t). 


Proof, Subtracting (10) from (9) and differentiating, we see that 


i(t) - y(t) - / (f(x(s)) - g(y(s)))ds 

0 


t t 

- / (f(x(s)) - f (y (s) ) ) ds ♦ / (f (y(s) ) - g(y(s)))ds 

0 0 


Hence, from (5) and (1), we obtain 

t 

l|x(t) - y(t) | | < B / I |x(s) - y(s) | Ids + 5t . 
0 
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We now use Theorem 1 to obtain 

. . t 1 i 1_ 

I I x(t) - y(t) | | < 6 J (cosh (B 2 s) - l)ds ♦ St = SB~ 2 sinh B 2 t 
0 

as we wished to prove. 


111,2. A Useful Lemma 

In the application of the preceding subsection to the equations of motion 
for the n-body problem it is necessary, first, to choose a region U in which 
a suitably chosen Lipschitz condition will hold, and, secondly to verify the 
Extension Hypothesis, introduced in Theorem 1 , For both of these nurposes the 
following Lemma is useful, 

Lemma 1 * Let = r " 1 if r >, 1 ; but, for 0 <. r <. 1 , let 


♦kW 


(6k+3) (2k+3) (2k+l) (2k+3) 2 3(2k+3) 2k 

8 (k + l)k s¥(k-l) r + 8 k (k- 1 ) r 


3 (2k+l) _2k+2 

8 (k+l) k r 


where k> 1 , 


Then these definitions agree at r « I, Moreover 


c C ,M for 0 <_ r < 


and 


U k (P) Cr)| < p ♦ O p Ck), 


where 


lim 


Op(k) 


0 , 


P = 1, 2. 


Proof, To show that e C'" it is only necessary to compute the right handed 

and left handed derivatives of orders 1 , 2 , 3 at the one point r * 1 where 
\ obviously fails to be analytic. It will appears that each left handed 

derivative at this point is equal to the corresponding right handed derivative. 
Moreover these right and left derivatives are also the limits of derivatives 
taken on the right and left respectively. The details arc elementary and are 
omitted. We record, however, that for 0 4 r < 1. 


*1 fvi - - l 2 -k+l) (2k> 3) _ ^ 3(2k+3) J>k-1 3(2k+l) _2k*l 

* 4k(k-l) r 4(k-li 4k — r 

r-'-'i . _ X2_k*l) (2k-3) . 3(2k+3) ( 2 k-l) „2k-2 3(2k*l ) 2 2k 

4k(k-ll 4(irn r * r 

*••• Cr) - 3(4k 2 -4k-3) 2k-3 3(4k 2 *4k + l) 2 k-l 

Y k v * 2 2 
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Since <j>, (r) and its derivatives are, for r ^ 1 , simple monotonic 
functions of r which tend to 0 as r *► «, the problem of estimating the 
maximum moduli of the first two derivatives of $ k (r) ma Y be reduced to a 
study of what happens on the interval (0, 1]» 


Since $"’ k (l) « -6, since <j>'" k (r) has a simple zero at 
2 . 1/2 

r ■ r « ( 4K ^ .... ) t and since it vanishes nowhere else on the open interval 
k 4k^+4k+ 1 

(0, 1), it follows that <fr n k (r) assumes its maximum at r * r k> But a 
straight forward calcultion based on the formula for 4>" k (r) shows that 

.. , % (2k + 1) (2k+3) ,12k 2 + 12k + 3 w 4k 2 +4k-3, k 

♦VV - - Wit— * 1 

From this, we obtain by routine methods that 


lim 

k-*» 


♦VV 


2 


Since ds", (0) = -1 and A", (r) is monotonic increasing on [0, r, ] and 

k-*“ k k K 

monotonic decreasing on [r k , ®) , it follows that 


|4" k (r)| < 2 ♦ 0 2 (k), 0 2 (k) - 0 . 

which is one of the two desired estimates. To get the other estimate we write 
the above formula for $' k (r) in the form 


(13) 4>' k (r) “ r b k ( r) 

where 

, (2k+ 1) (2k + 3) . 3(2k+3) 2k-2 3(2k+l) 2k 

V r) 4k(M 4(ii-p r 4k r • 

It is readily found that h' k (r) is positive throughout the open interval 

(0, 1), Hence 

- ■ V°> i h k (r) - h k cl) ’ ♦V» ’ - 1 
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for all r on the closed interval [ 0 , 1 ], Hence multiplying by r and using 
(13) we obtain, for r e [0, 1], the following inequalities 


(2k+l) (2k+3) 
4k(k-l) 


(2k*l) (2k+3) 
4k (k- 1 ) 


r < rh R (r) 


It follows that |$'j.(r)| 4. c 1 

required estimate. 


a <t>* ^Cr) 4 - r < 0, 

+ 0 9 (k) # which is the other 


111,3, Lipschitz Constants for Non-Convex Regions 

If f is an N-vector function of class C* defined over a convex region 
R in the N-dimensional vector space of the vectors x Rnd if the partial 

derivatives of the components of f with respect to the components of x are 
bounded in R, then one can always find a Lipschitz constant B, which is 
related in a simple way to bounds on certain expressions involving these 
partial derivatives, such that (5) holds for any two points x' and x in R, 
For instance if P is the least common upper bound for the absolute values of 
all of these partial derivatives, one may take B * NP, at least if | jxj | 


denotes the "Euclidean" norm 


C * 

i* 1 


or the nameless norm 


E 

i = 1 


|xj 


O 2 


or the "uniform" norm 


max (j 
l<i<N 




If the region R is not convex the situation is more complicated. One way 
of dealing with it is to choose a convex region R* which contains R and seek 
a C* extension f* of f over the whole of R* in which the bounds on these 

derivatives over R* exceed the bounds over R by certain increments which 

are regarded as permissible. These permissible increments in some cases, which 
we term regular (but by no means in all cases), may be taken arbitrarily small. 
From these bounds over R* ( we then obtain a Lipschitz constant 13* such that 

(14) ||f*(x') - f»(x)|| 4 B* | j x* - x | | 

as long as x and x' are in R* and hence a fortiori as long as they are in 
R where f* * f. 

In the regular case , just defined, the B* would exceed by an arbitrarily 
small amount tne B obtained in the same way from the bounds of the partial 

derivatives over R but ignoring the non-convexity of R where f* * f. In 

other words if e is any positive number we could (in the regular case) choose 
f* in such a way that the B* in (14) is not greater than B + e; so that, 
remembering that f* is always the same as f in R, we obtain, for any two 
points x’ and x in R, the following inequality. 
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| | f(x') - f(x) | | 4 (B ♦ c) | | x’ * x| | . 

Hence, allowing e to tend to zero, we obtain (5). Thus in the regular case, 
the Lipschitz constant may be regarded as related to the bounds for the partial 
derivatives exactly as they should be if R were convex. 


III. 4, Application to the n-Body Problem 

In the problem of n bodies, the N * 3n components of the vector x 
are denoted by (x^ y^ z 1# x 2 , y 2 , z 2# x n , y n , z n ) , where x. , y., and 

are the coordinates of the mass point nu. The 3n components of f may 

be written in the form 


l au 

m. 3x. ’ 


l l 


l au i_ au 

m. 3y, * m. 3z. * 

l * l li 


where 


i - 1, 2, 


n 


U 



and 


r. . 


( (x. -x . ) 

i y 


‘W 


Cz.-z .) 2 ) 1/2 

i y 


A typical closed region R to be considered is one which excludes all 
points for which one or more of the r^ are zero, say the set of all points 

for which r. . > a , . ; i, j»l, 2, i^j. Here o . , ■ a . , are 

i) - ij * J * ' * ’ J i j ji 


(l/2)n(n-l) assigned positive numbers. This region is evidently not convex; 
but we nevertheless are in the situation of the regular case mentioned above. 

In fact we can take R* to be the whole 3n-dimensional space and obtain a C' 
extension of f over the whole of R* by choosing a positive integer k > 1 
and taking the 3n components of the extension f* to be 


3 V, 


m. 3x. 
l i 


i_^k 

m. 9y. 


L .^1 

m. 3z. 


i » 1, 2, 


n. 


where 


l 

i<j 


-1 A , -1 , 
m . o.. (a . . r. . ) 
j g y k v ij 


Then from the lemma of SubSection 1 1 1. 2 one finds that bounds for the 
partial derivatives of the f * over the whole of R* exceed the bounds for 
corresponding derivatives of f over R by arbitrarily small increments 
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provided that k is taken sufficiently large. 

It is evident also that this same substitution of for u suffices 

to establish the Extension Hypothesis of Theorem 1, Sub-Section 111,1, 

For simplicity, we have considered here only the unreduced form of the 
equations for the n-body problem; but similar considerations would apply 
equally well also to such other forms as the helio-centric equations, the 
barycentric chain forms, or the various reduced equations considered in Section 
I. 
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SUMMARY 


The perturbation problem for a system of non-linear differential 
equations under general linear (but not necessarily homogenous) two point 
boundary conditions is reduced to the problem of solving k "bifurcation" 
equations, where k is the degeneracy of the problem. In the non-degenerate 
case k = 0, the set of such equations is vacuous and the problem is 
automatically solved. If k ji 0 and if there are k independent first 
integrals, a significant transformation of the bifurcation equations in terms 
of these first integrals is carried out. The largest section of the paper is 
concerned with the construction of generalized Green’s matrices (and with 
kindred matters) for linear systems with boundary conditions of arbitrary 
degeneracy, The generalized Green's matrix for the variational equations is 
prerequisite to our treatment of the non-linear problem. 


INTRODUCTION 

We extend here some results previously obtained for periodic boundary 
conditions, C£ , Annals of Mathematics, volume 63 (1956) pp. 535-548, Our 
original purpose was to develop general methods for the perturbation of 
periodic solutions of the n-body problem satisfying certain added conditions 
of symmetry. For this purpose the problem frequently could be formulated in 
terms of two point linear homogenous boundary conditions. It then became 
evident that the same methods might work also for problems of entirely 
different nature. Hence we devised the formulation described in Section I, 
where the boundary conditions are linear but not necessarily homogeneous. It 
is probable that much could be done for non-linear boundary conditions, but 
our historical introduction to the problem was such that questions of this 
sort arose too late to be considered here. There is obviously much still to 
be done in this field. With this in view we have carried out the developments 
of Section II on linear systems to a rather more complete state than strictly 
necessary for the primary purposes of this paper alone. 
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I * FORMULATION OF THE PROBLEM 


We are concerned with the problem of solving a system of differential 
equations under general linear (but not necessarily homogeneous) two point 
boundary conditions. More precisely, let the differential system be written in 
the form, 

dC/dt - F(t, £, u) , 

where t is the (scalar) independent variable, £ is an n-vector the 
components of which are the unknown functions, and p is a scalar parameter. 

F is an n-vector function defined and of class C* for 0 < t < T, as long 
as p and C belong to suitable domains. We seek functions £(t) satisfying 
the above system as well as the following system of boundary conditions: 

B 0 UO) ♦B T ((T)-p, 

where p is a given n-vector and both and are given n * n matrices. 

This system of boundary conditions is equivalent to n scalar conditions, which, 
if they are independent, are presumably just right to determine the n constants 
of integration for our n t ^ order system of differential equations. To insure 
the independence of our boundary conditions, we assume that the rank of the 
n x 2n matrix (B^, B^) is equal to n. 

The perturbation problem, to which we devote attention, assumes that for a 
particular value of p , which without loss of generality we may evidently take 
to be 0, we have a known solution, say ;(t), Our problem is to determine a 
solution for p i 0, say £(t, p) , such that lim £(t, p) ■ C(t). 

p-K3 

We obtain a more convenient formulation of our problem by introducing the 
following notations: 

x ( t ) - C(t) - C(t) 

A(t) - F ^ ( t , ;(t), 0) 

f(t, x, U) - F ( t , C(t) ♦ x, p) - F ( t , c(t, 0) - A(t) x , 

where F^ denotes the jacobian matrix of the components of F with respect to 

the components of £, Since £(t) is regarded as known, so is the matrix A(t). 
And evidently f(t, X, p) is of class C* and vanishes together with its 
partial derivatives with respect to the components of the vector x, when 
p * 0 and x ■ 0, 

Since £(t) is supposed to be a solution of the two point boundary problem 
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when u ■ 0 ( we have 

dC(t)/dt - F(t, c(t), 0) 


and 


B 0 c(0) + B t ; (T) - p . 

It is now clear that £ will be a solution of the original two point 
boundary problem if and only if x satisfies the system 

dx/dt ■ A(t)x ♦ f(t, x, u) 

and the homogeneous linear boundary conditions 

Bq x(0) ♦ B t x(T) - 0 . 

Our perturbation problem can now be formulated in terms of finding a solution 
x ■ x(t, y) of the last written differential sytem and the last written 
boundary conditions such that lim x(t f u) ■ 0. 

y*K) 

It is relatively easy to prove, for |y| sufficiently small, the existence 
of such solutions provided that the so-called variational system 

dx/dt * A(t) x 

has no solutions satisfying the above homogeneous boundary conditions other than 
the "trivial" solution x = 0, This is the so-called non-degenerate case. In 
this case the solution may always be found by a convergent process of 
successive approximations. 

Many of the interesting problems in celestial mechanics are not, however, 
of this non-degenerate type. The problem is said to have degeneracy k, if the 
variational system has k linearly independent solutions satisfying the above 
homogeneous boundary conditions (upon which all other such solutions are linear- 
ly dependent). If k > 0, the original problem may have no solution; but, if 
it has, it may be found by a method of successive approximations followed by a 
solution of a system of k so-called "bifurcation" equations. The way in which 
this is done is explained in Section III, In the non-degenerate case k = 0, 
there are, of course, no bifurcation equations to solve. 

To prepare the ground for the treatment in Section III, we must present in 
the next Section a considerable theory of linear systems. The non-degenerate 
case is included because all considerations are valid when k * 0. In fact most 
of the difficulties completely collapse when k * 0, many statements and 
conditions becoming vacuous. 

The linear theory of Section II has close relationship with previous work 
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by IV. T. P.eid on generalized Green's matrices for compatible systems of 
differential equations. (American Journal of Mathematics, volume 53 (1931), 
pages 443-459). In this reference there are references to still earlier work 
by G. A. Bliss and others. It would probably be possible to derive all the 
results we need by citing various theorems presented by these earlier authors. 
It will be easier, however, to give independent proofs; and this will have the 
advantage of developing some additional facts, which we shall also need. 


II, PRELIMINARIES ON LINEAR SYSTEMS 


IVe wish to consider continuous n-vcctor functions x(t) * [xj(t), 

( t ) J defined on the interval [0, T] and satisfying n independent linear 

homogeneous end conditions of the form, 

CD B () x(0) ♦ B t x(T) - 0 , 

where and By are given constant n x n matrices. 

A vector x(t) which satisfies (1) will be called admissible . Otherwise 
it will be called inadmissible . 

The condition (l)can also be written in the form 


(2) 



0 


where (B , By) is, of course, the n x2n-matrix formed by the indicated 
juxtaposition of B ( ^ anti B and likewise 

/ x(0)\ 

V x(T v 

is a 2n x l matrix formed by the juxtaposition of (column) vectors x(0) and 
x(T). Since the n conditions given by (1) or (2) are to be linearly 
independent, it is seen from (2) that it is both necessary and sufficient to 
assume the rank of (B^, By) to he n, as we henceforth do. This means that 

(Eq, By) contains a non-sigular n x n matrix C. The matrix (B Q| By) may 

then be modified by permuting its columns so that it appears in the form 
(C, D). Then evidently 
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where I is the n * n identity matrix. We thus find that there exist n x n- 
mat rices Up and U^, such that 


(3) 



- I . 


In fact, we may obtain 



by performing on the rows of 



the same 


permutation as we apply to the columns of (C, D) in order to recover 
(Up, B^,) , We also introduce the matrix U(t) defined by 

(4) U(t) - (1 - tT' 1 )U Q ♦ tT* 1 U T , 

so that U(0) ® U and U(T) ■ U . Ihe following lemma then follows at once 
from (3), 

Lemma 1 . If (Bp, B ) is of rank n, there exists an n x n-matrix U(t) 
whose elements are linear functions of t such that 

(5) BpU(O) ♦ B t U(T) - I . 


Incidentally, although the columns of U(t) are all continuous n-vector 
functions of t, they are all inadmissible in the sense of the above definition 
of admissibility. 


Theorem 1 , Consider the linear differential system, 

(6) dx/dt * A(t)x ♦ f(t) , 


where x and f are n-vectors and A is an n x n-matrix, A and f are 
known continuous functions of t, defined for 0 ^ t ^ T. Let X(t) be any 
n x n-matrix such that dX/dt ■ A(t)X and det X(0) + 0 (and hence also 
det X(t) i 0 for any t on [0, T]). Let Bp and be given constant 

n x n-matrices such that the rank of (B Q , B^) is n. Let n - k denote the 
rank of the n x n-matrix BpX(O) ♦ B X(T), so that k is the number of linear- 
ly independent solutions of the homogeneous system corresponding to (6) which 
are admissible in the sense (as defined above) that they satisfy the boundary 
condition BpX(O) ♦ B^x(T) ■ 0. 

Then there exist (independently of f) a k x n-matrix function 2(s) and 
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an n x n-matrix G(t, s) , both continuous, except that G(t, s) posseses a 
finite jimp at t ■ s, having the following three properties: 

I. The system (6) possesses an admissible solution (i, e,, a solution 
x(t) such that B q x( 0) ♦ B^,x(T) - 0), if, and only if, 

T 

(7) / H(s) f(s)ds ■ 0 . 

0 

II. If (7) is satisfied, the vector function 

T 

(8) x(t) - / G(t, s) f(s)ds 

0 

is a solution of (6) and is, moreover, the only admissible solution orthogonal 
to every admissible solution of the corresponding homogeneous system. 

III. Whether (7) is satisfied or not, x(t) , defined by (8) is admissible. 

Proof . Since n - k is the rank of B (1 X(0) + B^X(T) , there is a k * n- 
matrix (Jt and an n * k-matrix both of rank k such that 

00 OL[B 0 X(0) ♦ b t X(T)J - 0 

and 

CIO) [B Q X(0) ♦ U t X(T) ]» ■ 0. 

Here, of course, 0 < k < n, The general solution of (6), whether admissible 
or not, is well known to be of the form 

t , 

(11) x(t) - X(t)B ♦ / X(t) X(s) " A f(s)ds . 

0 

where the constants of integration are the components of the n-vector 0. The 
attempt to find admissible solutions of (0) leads to the equation 
B q x(0) ♦ B t x(T) - 0, or 

T , 

(12) [B o X(0) ♦ H t X(T)JS ♦ / ll T X(T)X(s)‘ 1 f(s)ds - 0 , 

for the determination of 0. On account of (3) » this system of linear equations 
for the determination of B is consistent if, and only if, 
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(13) / 0tB X(T)X(s)" 1 f(s)ds = 0 , 

0 

which can be written in the form (7), if we let 

(14) =(s) . (Hb^CTJXIs)" 1 . 

If (13), or (7), is satisfied, it is possible to choose 3 in infinitely many 
ways so as to satisfy (12). Moreover, since the second term in (12) is the 
integral of a known matrix function of s multiplied into the vector f(s), we 

see that 0 may be chosen to be given by a similar expression; say 

T 

B « / P(s)f(s)ds 

0 

where P(s) is a suitably chosen (not unique) continuous n x n-matrix functin 
of s, independent of f. Substituting in (11) we have 

T 

(15) x(t) * / K(t, s)f(s)ds , 

0 

where K(t, s) - X(t) [P(s) * X(s) *] for 0^ s < t and K(t, s) - X(t)P(s) 

for t < s < T, The x(t) given by (IS) always satisfies (6) but is admissible 

if, and only if, (7) is satisfied. 

We now wish to find another kernel matrix K(t, s), such that, upon 
writing 

T _ 

( 16 ) X(t) - / K(t, s)f(s)ds , 

0 

we can say that the x(t) given by (16) is always admissible and satisfies (6) 
if (7) is satisfied. Since x(t), as given by (15) is admissible if, and only 
if, (7) is satisfied, it is clear that (7) is equivalent to the condition 

T 

( 17 ) / (B 0 K(0, s) ♦ B t K(T, s)) f(s)ds - 0 . 

Hence, if we set 

K(t, s) - - U(t)[B 0 K(U, s) t B t K(T, s)] ♦ K (t , s) , 

so that 
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1 ' 

C 17*) x(t) - - U(t) / [B Q K(0, s) ♦ B t K(T, s)] f(s)ds ♦ x(t) , 

then XU) coincides with x(t) whenever (17), or its equivalent (7), is 
satisfied. This is true for any n * n-matrix function U(t) , but the 
following statement depends upon choosing U(t) in accordance with Lemma 1. 

I rom ( 1 7*) , wc have 

T 

B 0 x(0) + B. r x(T) * - [Byli(O) * B t U(T)] / [B Q K((>, s) ♦ B T K(T, s)]f(s)ds 


♦ BqX(O) + B t x(T) , 

which, in virtue of (15) and the fact that [B () U(0) + B T U(T)] « I by Lemma 1, 

must vanish. Thus x(t) , as defined by (16), must always be admissible, _ 
whether (7) is satisfied or not, as we wished to prove. And, of course, x(t) 
also satisfies (6) if (7) is fulfilled , because we already know that if (7) is 
fulfilled x(t) coincides witK x(t) , which always satisfies (6). 

Next we wish to find a kernel matrix 6(t, s) such that the x(t) given 
by (S) satisfies the same properties as have already been specified for x(t) 
as given by (lb) plus the additional property that x(t) is always to be 
orthogonal to every admissible solution of dx/dt * A(t)x. 

Assuming that (7) is satisfied, every admissible solution of (6) can 
evidently be written in the form 


(18) x(t) « X(t)j6 c + / k(t, s)f(s)ds , 

0 

where c* is a suitably chosen k- vector. This follows from the fact that the 
columns of X(t * form a complete set of admissible solutions of the homo- 
geneous equations because of (10) and the fact that the rank of B 0 X(0) + B^XtT) 

is n - k, Lven when (7) is not satisfied, both terms on the right of (18) are 
known to be admissible and hence x(t) as given by (18) is always admissible, 
Wc now show that it is always possible to choose c so_ that 

T 

(19) / (X(t)JB)’ x (t) dt - 0 , 

0 

in other words , so that the x(t) given by (18) is_ orthogonal to all the 
admissible solutions of the homogeneous equations and that this is true 
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regardless of whether (7) holds_. From (18) and (19) we see that it is 
necessary and sufficient for c to satisfy 

T _ T T _ 

/ & X(t)'X(t)gc dt ♦ / fc' X(t) 1 / K(t, s)f(s)ds dt - 0 

0 O'" 0 

or 

T - T T „ 

(20) {/ Jg X(t)'X(t)gdtl c - -/ ft'(J X(t)'K(t, s)dt)f(s)ds . 

0 0 0 

We prove that the k * k matrix within the brackets { } is non-singular by 
showing in the following way that it is positive definite. Let q be any k- 

vector. Since X(t)* 1 exists, J£q = 0 if, and only if, X(t)j£q = 0. Since 
Jo is of rank k ,_ this is possible only i_f q ■ 0, The quadratic form 
q(X(tjS;) ' (X(t^J )q in the components of q is the sum of the squares of the 
components of X(t)ftq and is, therefore, positive definite (in the real 
field). Hence 

T _ _ _ T 

/ <lJB’X(t)'X(t)Bq dt - q (/ JB'X(t)'X(t)ft dt) q 
0 0 

i_s also positive definite, considered as a quadratic form in the components of 
q. Thus we now know that (20) can be solved for c in the form 

_ T 

(21) c ■ / j£(s)f(s)ds , 

0 

where £j(s ) is (for a given K(t, s)) a uniquely determined continuous^ kx n - 
matrix function of s, independent of f. Substituting this value of c in 
(18), we finally arrive at (8) with 

(22) G(t, s) * X(t)J5s2/ s ) + *(t, s) . 

The G(t, s) just introduced by (22) thus satisfies all the requirements 
of the theorem; so that the proof of the latter is now complete. 

Lemma 2 . The rank of the k * n-matrix is k. 

Proof, Suppose q is a k- vector such that 

(23) q 0lB T « 0 . 

It is enough to show that q must be the null vector, for this would mean that 
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the k rows of would be linearly independent, thus implying that its 

rank would be k. Multiplying on the right by X(T) we have qCtB^,X(T) * 0. 
And, since by (9) it is known that O^B^X(T) * - (Rb q X( 0) , we also have 
q0!P o X(O) ■ 0. Multiplying on the right by X(C)" 1 , we find that 

(24) qOlB 0 - 0 . 

Multiplying (24) and (23) respectively on the right by U(0) and l)(T) , and 
then adding, we obtain 


q <Jt[B 0 U(0) ♦ B t U(T)] - 0 


But, since, by Lemma 1, the n x n-matrix in the square brackets is just the 
identity matrix, we see at once that q OL* 0; and, since the rank of the 
k x n-matrix 01 is already known to be k, it follows of necessity that 
q ■ 0, as we wished to prove. 

Theorem 2 , The rank of H(s) - 0tB T X(T) X(s)' 1 for each value of s on the 
interval [0, T] is k. 

Proof , This is a mere corollary of Lemma 2, since the rank of X(T) is n 
and the rank of X(s)”* is also n. 

It should be noted that neither E nor G are uniquely determined by thi 
requirements of Theorem 1, However we now prove the following 

Theorem 3. Let !i(s) be any continuous n * k-matrix function such that 

T 

(25) C - / H(s) H (s ) ds 

0 

is a non-singular k * k-matrix. Then the G(t, s) of Theorem 1 may be 
determined uniquely in such a manner that 

T 

(26) / G(t, s) 11 ( s) ds = 0 

0 

Remark. A possible choice for H(s) is E(s)'. For, in this case, we have 

T 

C - / E(s) 5 (s) ' ds , 

0 
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which, since H(s) is of rank k by Theorem 2, is easily proved to be a 
positive definite matrix and therefore non-singular. 

Proof of Theorem 3 , Let Gg(t, s) be any kernel satisfying the requirements 

of Tneorem 1, and let R(t) be any n * k-continuous matrix function whose 
columns are admissible. Thus, if we define 

G(t, s) » C Q (t, s) - K(t) E(s) , 

It is seen that G(t, s) also satisfies all the requirements of Theorem 1. 

If we choose 

T 1 
K(t) = (/ G (t, s) li(s)ds) C' 1 , 

0 

it is easy to verify that (26) is satisfied. Moreover, this R(t) is 
admissible because G y ( t, s) satisfies Property III of Theorem 1. 

Let ij>(s) be an arbitrary continuous vector function, and then set 

-1 T 

f(s) * $(s) - H(s) t / E( z) <{>( zj dz . 

0 

We easily see that this f satisfies (7), because of (25). Hence, from (26), 
we prove that 


T T 

x(t) = / G(t , s) f ( s ) ds « / G(t , s) <j>(s) ds . 

0 0 

If G* (t , s) were a second matrix kernel satisfying the requirements of Theorem 
1 as well as (26), we could also write 

T T 

x*(t) = / C*(t, s) f(s)ds * / G* (t , s) 4> ( s ) ds . 

0 t) 

But x*(t) h x(t) because of the uniqueness statement under Pronerty II. Hence 

. T 

/ [G(t , s) - G* (t , s) J b(s)ds - 0 
0 

for arbitrary continuous <fr. It follows that G(t, s) = G*(t, s), thus 
completing the proof of Theorem 3, 

Lemma 3 , If B Q and arc n x n-mat rices and if (B Q , B^) is of rank n, 
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.hen there exist n * n-matrices I : Q and F^ such that 


(27) 

and such that / F ; 


Vo - Vt ’ 0 


(J;) “ ° f 


rank n. Moreover 


l : Q and F t are uniquely 


determined by these conditions up to a multiplication on the right by a common 
non-singular matrix. 

Proof . Since (B^, B^) is of rank n, a suitable permutation on its columns 
yields an n * 2n-matrix (C # D) , where C is non singular. Then 


, D) / C~ l DG|* 

v c ) 

0 ) 


DG’- DG = 0 regardless of the choice of the n * n-matrix G. 


Construct 


by performing the same permutation on the rows of JC D G j 

\- fi / 

as that which when applied to the columns of (C, 0) recovers (B Q , B^) . We 


thus have 


B Q F 0 “ Vt " <V / F 0 


(C, U) /c* 1 L) g\ • 0 , 


which establishes (27) for the constructed F's. Morcove 


rank oi 


/ \ ■ rank of / C * l) G\ , 

v't) \ ‘ G / 


r rank of / F fi \ 

V T / 


which is surely n, if, and only if, 


G is chosen to be non-singular. To see this, notice that any homogeneous 
linear relationship between the columns of G holds also for the columns of 
C* 1 D G. 

Suppose now we had two pairs l ; 0 , F^, and F^, F such that 
B 0 F 0 “ B T F T " B 0 F 0 “ B T F t * anc * the ran ^ s of both /f q \ and/ F Q \ 

Vt/ Vt) 

equal to n. Then the same permutation applied to the columns of (B Q , B^.) 
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and to the rows of / F n \ and / F \ sends the equations 


CB o'V/ F uV° and (B o 


T./ 


• v (;;) ■ 0 1 


into the equations (C 


. U) ^ Ej=0 


and (C 




-l r> 


These can be satisfied only by taking ii * C DC and 


h = C~^DG. Since both / F n \ and / 1 ; Q \ have rank n, we see as above that 


both G and G must be non- singular. Hence E * EK, where K * G * G, 
Since also G* - GK, we see that 


Permuting the rows of 


/ E \ and / E \ 

w w 


by the inverse of the permutation last 


mentioned leads to the result that 



where K * G* 1 G is non-singular. This establishes the last statement of the 
lemma. 

A continuous vector function x(t) defined on [0, T] will be 
said to be adjoint ly admissible if x(0)Fq * x(T)F^, * 0, where the F's 
satisfy the conditions expressed in Lemma 3. As a consequence of this leama 
it makes no difference as to which particular F's are chosen. A change in the 
F's only amounts to replacing the last written equality by the equivalent 
relation (x(O)Fq + x(T)F T ]K » 0, where K is non-singular. 

Theorem 4. The rows of E(t) are adjoint ly admissible. In other words, 

(28) H(0)F 0 ♦ S(T)F t « 0 . 

Proof . We note from (9) that ^B^X(T) « -0L*qX(O), Hence, from (14), 
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5(0) - CIb t X(T)X( 0)' 1 . - tflByXCOJXCO)* 1 • - OtB 0 , so that 
(20) 5(0) - - OLB 0 . 


Also, form (14), it is obvious that 

(30) 5(T) - ♦ OlB T . 


Hence, from (29) and (30), we see that H(0)F 0 ♦ 5(T)F T - - (J|B F Q - B^,) , 

which, by Lemma 3, must vanish. This establishes (28) and finishes the proof. 

Theorem 5. The rows of 5(t) constitute a complete set of linearly 

independent adjointly admissible solutions of the homogeneous linear system 

(31) at/dt ■ - C A(t) 

which is adjoint to the system dx/dt * A(t)x. 

Proof . It is well known that the rows of X(t)"* satisfy (31), Since the 

rows of 5(t) « 0tB^,X(T) X(t) " * , cf, (14), are merely linear combinations of 

the rows of X(t)’*, the rows of E(t) must also satisfy (31). In Theorem 2, 
it was proved that the k rows of H(t) are linearly independent; and, in 
Theorem 4, it was proved that the rows are adjoint ly admissible. Hence 
Theorem 5 will be completely proved as soon as it is shown that k is the 
maximum number of linearly independent adjointly admissible solutions of (31). 
This is most easily done by noting that the relation between a system and its 
adjoint is a reciprocal one. The adjoint of the adjoint of a system is 
obviously the original system. The same may be said about admissibility and 
adjoint admissibility. A vector function which is adjointly admissible 
relative to the adjoint boundary conditions is admissible in the original sense. 
Hence, if there were k' (> k) linearly independent adjointly admissible 
solutions of the adjoint system, Theorems 1-4, applied to the adjoint system 
with adjoint boundary conditions, would tell us that there would have to be at 
least k' linearly independent admissible solutions of the original system 
dx/dt ■ A(t) x. But we already know that there are just k of them. This 
completes the proof, 


III. BIFURCATION EQUATIONS 

Our method of solving the problem formulated in Section I involves the 
preliminary solution of the system of integral equations 
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T 

(32) x(t, c , y) - X(tJJc ♦ / G(t, s)f[x(s, c, y), s, u]ds . 

0 

for the unknown function x. The solution will depend not only on t but 
also on the parameter u and the k-vector c. Since G is bounded and the 
partial derivatives of f with respect to the components of x are continuous 
and vanish when x and y do, it is almost self-evident that the following 
system of successive approximations 

x Q ( t , c, y) « X(t)Jgc 


/ G(t, s)f[x .(s, c, y) , s, y]ds, m >, 1 
0 


must, if |y| and ||c|| are sufficiently small, converge uniformly to a 
solution x(t, c, y) such that x(t, 0,0) « 0. Further details of the proof 
are indicated in a previous paper, The Role of First Integrals in the 
Perturbation of Periodic Solutions, Annals of Mathematics, volume 63 (1956), 
pp. 535-548, especially p. 545. Although in this previous paper only periodic 
boundary conditions were considered, the proof in the present more general 
case (in so far as it concerns the existence of solutions) is exactly the same. 
The main feature is that f must be Lipschitzian with arbitrarily small 
Lipschitz constant, if |y| and ||x|| are sufficiently restricted, because 
of the above mentioned properties of the partial derivatives of f. Assuming 
then that equation (32) is solved and that the solution is continuous (as it 
would have to be because of the uniform convergence of the continuous 
approximations) , we are in a position to proceed to the next theorem. 

Theorem 6. Let x(t, c, y) , where c is a k-vector, be a continuous 
solution of (32) such that x(t, 0, 0) = 0. Then 

(33) B q x( 0 , c, y) + B t x(T, c, y) - 0 
and x(t, c, y) will satisfy 

(34) dx/dt * A( t) x ♦ f(x, t, y) - H(t)a , 

where the relationship between li and G is as in Theorem 3 and where the k- 
vector a is given in the following formula: 

1 T 

(35) a « a(c, y) * C" A / ~(t)f[x(t, c, y) , t, y]dt . 

0 
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Proof. Let f(t) « f[x(t, c, y) , t, y] - ll(t)a(c, y) 

T T 

/ 5(s)f(s)ds - / =(s)f[x(s, c, y), s, y ] ds 

0 0 

T . T 

- (/ 5(s)H(s)ds)C* 1 / 5(t)f[x(t, c, y), t, y ]dt 

0 0 

T 

* 0 since C * / E(s)H(s)ds as in Theorem 3, 

0 

Hence the condition (7) is satisfied by the above defined f (t) , Hence the 
second term on the right of (32) satisfies (34) and is also admissible. Thus 
(33) is also satisfied. The first term on the right of (32) satisfies 
dx/dt = A(t)x and because of (10) also satisfies (33). It is evident then 
that the sum x(t, c, y) must have the stated properties, 

It is clear from the theorem just proved that, if c * c(y) can be 
chosen as a function of y in such a way that 

(3b) a(c, y) * 0 , 

then the differential system satisfied by x(t, c(y) , y) is the system 
dx/dt =» A(t)x + f(t, x, y) obtained from (34) and (36). Since the given two 
point boundary condition is also satisfied because of (33) , it is seen that 
this x(t, c (y) , y) solves the problem formulated in Section I. 

Thus the problem has been reduced to the problem of solving the system 
(36) of so-called bifurcation equations. This is generally very difficult. 
But, in case the differential equations admit some first integrals the 
bifurcation equations can be put into another form, We investigate this 
procedure when the number of first integrals is just equal to k. 

Suppose that 4>[x, t, y] is a k-vector, each component of which is a 
first integral of the system dx/dt - A(t)x ♦ f(x, t, y) , Therefore, 

$ x (x, t, c, u)[A(t)x + f (x, t, y)] + $ t (x # t, y) = 0. 

Letting x ■ x(t, c # y) and then integrating we get 

T 

/ V x(t * c# c » [ A (t) x Ct, c, y) ♦ f(x(t, c, y), t, y)] dt + 

P 4> t (x(t, c, y) , t, yldt « 0. 

0 
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On the other hand we have by the fundamental theorem of calculus 


$[x(T, c , y), T, y] - $[x(0, c, y), 0, y] - 

T t T 

/ 4> (x(t , c, y)x(t, c, y)dt ♦ / 4> t (x(t, c, y) , t, y)dt. 

0 X 0 

Hence, by subtraction, we find that 


$[x(T, c, y), T, y] - $[*(0, c, y) , 0, y] - 
T 

- / * (x(t, c, y), t, y) [x(t , c, y) - A(t)x(t f c, y) - f (x(t,c,y) , t ,y) ]dt 

0 

T 

- - / U(t, c, n), t, \i) H(t) a(c,ii) dt by (34) . 

0 


We suppose that the components of <J>(x, t, y) are independent first 
integrals at least for the solution x(t, 0, 0) ■ 0, This amounts to assuming 
that the kx n - matrix 4» (0, t, 0) has rank k. It is natural therefore to 
dispose of the arbitrariness of the n * k - matrix H(t) , introduced first in 
Theorem 3 and used again in Theorem 6, by setting its transpose equal to 
4> x (0, t, 0), at least provided that 

T 

C • / E(s)H(s)ds 
0 

turns out, for this choice of H, to be non-singular. Since therefore 
* (0, t, 0) * H(t) * and since 

T 

/ H(t)' ll(t)dt 

0 

is obviously positive definite so that it is, a fortiori, non-singular, it is 
seen by continuity that the matrix 

T 

f d (x(t, c, y), t, y)H(t) dt 
0 X 

must also be non-singular for sufficiently small | | c I j and |y| . 
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It now follows from the last equality that the so-called "bifurcation 
equation", a(c, y) ■ 0 is equivalent to the equation 

4>[x(T, c, y) , T, y] - $ [x (0 , c, y) , 0, y] « 0 , 
at least if j [ c | j and |y| are sufficiently small. 

In the periodic case where $(x, t, y) = $(x, t ♦ T, y) and 
x (0| c, y) ■ x(T, c, y) , the above result shows that the bifurcation equations 
are identically satisfied. But this happy circumstance apparently need not 
occur in general. 
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SUMMARY 


The derivatives of a function of a function have been automatically 
computed applying the recursive Schlomilch-Cesaro formulation. Tables 
given for this purpose in collections of mathematical formulas were thus 
extended up to the eighteenth order. 

Then, the general expressions thus obtained were used to construct 
the power series expansions in the time variable of those powers of the 
radius vector which appear most frequently in celestial mechanics, 
namely, the force function, the force of attraction, and the derivatives 
of the force function with respect to the coordinates. These last expres- 
sions are given in terms of the radius vector evaluated at origin and the 
triplet of Stumpff's local invariants. The explicit expressions of these 
series were symbolically computed using the FORMAC language. 


INTRODUCTION 

The need to compute the Taylor series expansion of a function of 
a function arises frequently in celestial mechanics. Even though by the 
standards of this discipline the computation of such series is an ele- 
mentary problem, there is no doubt that its actual computation, if carried 
out by hand and to a high order, can be very tedious and time-consuming. 
This seems to be a case, therefore, where a computer programmed to 
perform symbol manipulation could relieve the scientists of this heavy task. 
We present here, first, a general algorithm given by Schlomilch and Cesaro 
to compute the derivatives of a function of a function in a recursive mode. 


* This report presents part of the work being performed under contract 
NAS 12-87, "Automatic Symbol Processing Techniques." 


19 5 


HIGH ORDER TIME DERIVATIVES OF POWERS OF THE RADIUS VECTOR 


This algorithm was programmed in FORMAC^^ and was used to 
compute the derivatives up to the order n = 18. Then, these general 
results were applied to obtain the expansions of r ^{t), where k=l,2, 3. 


FAA di BRUNO’S FORMULA 


Let 


(1 } u (t) = u[ v (t) ] 

be the function which we want to expand in Taylor series: 


( 2 ) 


u 


00 




00 



Then, the goal of the computation is to obtain explicit expressions for 

the coefficients c . For small n they are 
n 7 


(3) 


r du dv 

c i L d^ * dT J 

t=0 



and it is immediately evident that the amount of work required to obtain 
higher order derivatives increases rapidly with the order n. 

In spite of the complexity of this process, Faa di Bruno^ ^ has 
given an elegant and concise formula for the n^ derivative of a function 
of a function: 
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(41 = V n! dP u / 1. dv \ / J_ d^v j / 1 d^v \ k 

dt n ' 1 i! i ! -" k! dv P Ul dt J[2! dt ‘ ' 

where the summation sign extends over all the integer solutions of 

(5) i + 2j + ...+lk = n 

while, simultaneously, it is: 


( 6 ) 


i + j + . . . + k i 


The compendiousness of (4) makes it very adequate for theoretical 
work, for instance, to establish bounds, but it has little practical value 
for the actual computation of explicit expressions, particularly when high 
order derivatives are required. In effect, the most extended tabulations 

that can be used for this purpose, namely, those of the ir- and M -numbers 

3 [3l 

in table (24. 2) of a well-known collection of mathematical formulas , 

give these numbers up to n = 10, while in two different applications to specific 

[4] 

problems of celestial mechanics by Musen in perturbation theory and 
Szebehely^ ^ in the restricted three-body problem, it has been found 
necessary to compute these explicit expressions up to n = 8. It seems 
likely that in future investigations a much higher order of approximation 
might be required. 


SCHLOMILCH-CESARO’S FORMULA 

For n)10, then, there is no way of avoiding a lengthy and tedious com- 
putation. This appears to be a case in which a computer, adequately 
programmed, could take the burden of the long literal developments thus 
relieving the mathematician from this uninteresting task. 

While reviewing old and new mathematical literature on the subject of 
high-order derivatives of a function of a function, we have found a re- 
cursive procedure which adapts itself very well to being programmed in 
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FORMAC. We will call it the Schlomilch- Cesaro formulation 


because it can be found in the mathematical works of both authors . 
According to this method, 


[ 6 ] 


d) 


where 


( 8 ) 


d u __ ^ 
dt n V= ] 


D = 

V dv V 


D F 
v n, v 


and P ^ is a polynomial in the powers of 


( 9 ) 


d v 
dt 1 


Now, it is easy to establish the following recursion formula: 

{ V = 1 , 2, . . . , n+1) 

which should be supplemented by 


< 10 > P n+ 1 , V = 5T P n,v + *1 P n,V-l 


(ID 


P 


n+1 , 0 


0 , 


P 

n+1 , v 


0 


( V ) n + 1 ) . 


Noticing that for n = 1 there is only one polynomial 


( 12 ) 


p i,i = a i 


dv 

d~t ’ 


we obtain successively for the first five orders: 
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(13) 


P 2.1 

= 

a 2 

P 2,2 

= 

»? 

P 3, 1 

= 

a 3 

P 3,2 

= 

3a 

P 3.3 

= 

3 

a i 


1 a 2 


(n = 2) 


(n = 3) 


P, , 

= a 

4, 1 

4 

P„ . 

1! 

P 

P 

4, 2 

1 

P. . 

= 6afa 

4, 3 

1 

P. - 

m 

p 

4, 4 

1 


2 

l 2 


(n = 4) 


P 5,l 

= a 5 

P 5,2 

= 5a ,a h 
1 4 

P 5, 3 

2 

= 10a a . 

P 5, 4 

= lOa^a 

P 5,5 



2 3 
c 2 
5a i a 2 


(n= 5) 


This procedure has been used to generate ^ up to n= 18, 
at which value the numeric coefficients become too large to be computed 
in exact integer form. The polynomials for the first ten orders were 

[3] 

checked against the above mentioned tables and were found to be 

identical. As examples of the extension of these tables, we present 


* Jordan [7] gives the number of terms in all the polynomials P n ^ 
using Netto's notation T( /n). In the table on page 155 of reference [7], 
F( / 1 0 ) is given as 43, while our results and those of reference [3] 
indicate that the number of terms is 42. 
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in the appendix the polynomials P 


11, V 


and P . * 

12, V 


It is worth mentioning that these polynomials have a remarkable 

property: the sum of the coefficients is equal to the Stirling number 

M 

of the second kind . Thus, in the process of generating these 
polynomials, one also generates, as a by-product, the Stirling 
numbers. 


APPLICATIONS 


The method expressed in formulas (7) through (13) can be advan- 
tageously applied to computations frequently occurring in celestial 
mechanics. We show here with some detail its application to the 
construction of the time power series for the inverse square of the 
radius vector in the Keplerian motion along any conic section. In 
this case, r is a function of t through the Cartesian coordinates x, 
y, z. We take 


(14) u = 


r 2 (t) 


Then, we have 


(15) 


d V u v ( V + 1) ! 

, v ' " ' v + 2 

dr r 

.[ 8 ] 


Introducing Stumpff local invariants fi , a , € we have the following 

[9] 

expressions for the time derivatives of r(t): 


( 16 ) 


r o a 



* Polynomials of higher order ( n ( 18 ) can be requested from the authors. 
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3 

a 3 = V 3C? “ a/i " 3a€ > 

(16) a 4 = r o (-l5a 4 + 7a 2 /i + l8a 2 « -juc -3c 2 ) 

a c = r o (i05a 5 -60a^/i-150a 3 e +a^ 2 + 45ac 2 + Z4^ae ) 

where is the initial value of r. 

The polynomials ^ can, thus, be expressed in terms of the 
triplet /u, cr, c substituting ( 16 ) into (13). Then, taking into account 
(15) it is easy to obtain 



Expressions for terms of higher order are presented in the appendix, 
together with the corresponding terms for the inverse and inverse cube 
of the radius vector. All these expressions were produced by a single 
FORMAC program. The numeric coefficients were computed in exact 
form using rational arithmetic. These computations were truncated at 
n = 14, because overflow occurred while generating the numeric coef- 
ficients for the following term. These same computations have also been 
performed in floating point form; in this case it is possible to compute 
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the first twenty terms of these series. The expansions of any 
power of the radius vector could be obtained using this program. 

-2 

The time power series for r can be used to obtain directly 
the true anomaly through the integral of areas for any value of the 
time within the radius of convergence. The series for r* and 
r ^ can be applied to obtain the power series solution of the three - 
body problem in the time domain. 

The series for r ^ has also been applied 10 to the computation 
of ephemerides in the case of nearly parabolic orbits. 
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APPENDIX 


Table 1 
Polynomials P 


11, V 


p,, , = 

a , , 

11,1 

11 

p, , , = 

1 1 a a, _ 

11,2 

1 10 

P M , 

2 

55a, a rt 

11,3 

1 9 


2 9 


1 2 8 


38 


4 7 


5 6 


1 3 7 


14 6 


+ 13860LCP + 990a^a_ + 4620a_a_a, + 6930a _a.a_ 
15 27 236 245 


2 , 2 

+ 5775a_a. + 4620a_a_ 
3 4 3 5 


11,4 


3 2 2 2 

l65a,a 0 + 1980a a_a_ + 4620a a a, + 6930a,aa r 
18 127 13b 145 


2 2 

+ 17325a 1 a_a. + 6930a, a_a, + 27720a, a_a a 
124 126 1235 


+ 23100aaV + 6930aV + 34650a?aji + 15400a,cp 
134 25 234 23 


11,5 


330ajx_ + 4620afa_a, + 9240a^a_a_ + 5775a 
17 126 135 14 


+ 69300a^a a a + 20790a/i^a + 15400a^a^ 

1 ^ J 4 I L* Z> X J 


+ 69300a 1 ajQL + 34650a, aV + 17325a?a„ 
123 124 23 


11,6 


462afa, + 6930afa_a_ + ll550afa_a. + 46200afa_a^ 
16 125 134 123 


+ 34650a^ajx. + 69300afaja_ + 10395a, a 5 * 
124 123 12 
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P 


11,7 


P 


P 


P 


P 


11,8 

11.9 

11.10 
11,11 


P 


P 


12 , 1 
12,2 


P 


12,3 


462 afa + 6930a + 4620 + 34650 afajx.. 

15 12 4 13 12 3 

3 4 

+ 17325 a^a* 

330 afa. + 4620 a^a_a. + 6930 

14 12 3 12 

165 a ^ + 990 


Table 2 

Polynomials P^ ^ ^ 

+ 66 a 2 a l0 + 220 a ^ + 495 a 4 a g + 792 

2 

+ 462 a , 

6 

66 afa lrt + 660 a,a_a^ + 1980 a,ajx o + 3960 a.a.a. 

1 10 129 138 147 

2 , 2 
+ 5544 a,a_a z + 1485 a,a Q + 7920 a_a_a_ + 8316 a_a_ 

1 b o ^ o Z 3 7 c* 

2 3 

+ 1 3860 a-,a .a, + 9240 a_a, + 27720 a.a.a_ + 5775 a. 

246 36 345 4 


12 


12a j a n 
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P 


12,4 


P 


12,5 


P 


12,6 


P 


12, 7 


3 2 2 2 

220 0,^ + 2970 + 7920 a 1 ( ^ 3 a - 7 + 1 3860 0L 1 a ' 4 0L £ ) 

+ 8316 + 55440 a.a^a, + 83160 a.a>,a.a. 

15 1236 1245 

2 2 , 2 
+ 1 1880 ajCt^a^ + 55440 0^0,^ + 69300 0,^0^ 

+ 13860 + 83160 + 51975 

2 4 

+ 138600 aJX.a, + 15400 a. 

2 3 4 3 

495 afa_ + 7920 cfa a + 18480 aVa + 27720 afa. a 

1 o \ L i 1 3 o 1 4 D 

2 2 2 2 2 

+ 166320 a.a-ajx- + 41580 a.a_a. + 103950 aa.ci 

1235 126 124 

+ 1 38600 + 184800 a,a_a^ + 41 5800 a.ajxjx. 

134 123 1234 

3 4 3 2 

+ 83 160 ol, a a + 51975 gl.ql + 138600 a^a. 

1 2 5 2 4 2 3 

792 + 13860 + 27720 + 1 7325 a^ct^ 

+ 83160 a^a- + 277200 aVa a. + 61600 afcL 
125 1234 13 

+ 415800 afa^af + 207900 afaji. + 207900 a,A 0 

1 2 3 1 2 4 1 2 3 

+ 10395 a* 

924 era, + 16632 afa_a + 27720 afa.a + 1 38600 afa a^ 

16 125 134 123 

4 2 3 3 2 5 

+ 103950 a,a_ct + 277200 a aa + 62370 a.a 

124 123 12 
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P 


12 , 8 


792 + 1 3860 a^a 2 a 4 + 9240 cyx^ +■ 83160 

4 4 

+ 51975 o£c£ 


12,9 


12 , 10 


12,11 


12,12 


495 a®a 4 + 7920 aja 2 a 3 + 13860 

220 a^a 3 + 1485 

/ / 10 
66 


n 


n n ! 


Table 3 



1 


1 

— a 

r 


2 


3 


4 


5 


r (S" “ T 4 ° Z + I ' 2 - |^° 2 + T° 4 ) 


1 ( 9 35 3 152 7 3 u Z 

r V 20 € " a + T <a “ T 6 0 + J" a ~ 12 0 ° 


63 5 

T a 
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1 21 2 

7 V To “ 


315 


3 2 


720 ^ 16 € a 40 6 ^ + IT” 6 q2 


5 3 35 4 7 2 2 231 6 \ 

€ - —VO + + -yg- 0 j 


16 


80 


' 59 , 3 ^ 27 2 693 5 

r7 £ " J + 560 €MG + 77 €CT 


459 2 

+ IST 


315 2 3 

7T 6 G 


35 3 

16 € ° 


165 

16 


2 3 


Ma - a 


5040 


429 7 


1 / 2871 


195 2 2 


l — eju a — € u a + u 

V 128 448 ^ 40320 M 


3003 6 

— €G 


4131 2 2 27 2 2 3465 2 4 459 3 

' * + TT^TT € M + ~ l~a e a + T7777T € M 


896 


4480 


64 


4480 


315 3 2 35 4 3003 6 209 2 4 17 3 2 

32 € 128 6 128 AiCr + 128 M ° ~ 2688 ^ Q 

6435 8 \ 

uF CT ) 


1001 5 1199 2 3 

' <E jUO + 


16 


504 


360 


6435 _ 7 

32 


4345 2 3 


224 


e /ua - 


929 2 2 


6720 


e CJ — 


900 9 2 5 635 3 

~r~ A — € a — — - tuo 
64 504 M 


1155 3 3 

+ — € CT 


315 4 5005 7 1001 2 5 

■ € a + ' 777 * a 


128 


11 3 3 v 

+ — a n _ 

189 362880 


96 

12155 _9 
128 


■7 


10 


1 / 79079 6 

7 (-480- eMG 


1287 2 4 


128 


£ m a 


4939 

100800 


3 2 

c v CT 


3628800 


A* - 


109395 8 4433 2 4 28479 2 2 2 

e a - - ; t iua + ^ — 6 A* cr 


256 


64 


22400 
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2 

e 

3 

+ 

45045 2_6 

1397 3 2 

"T6F £ “ a - 

929 

3600 


128 * ° + 

67200 € 

15015 

128 

3 4 
■ « a 

127 4 

1008 6 ** 

3465 4 2 

+ 256 ' a 

63 ' 
256 € 

7293 

8 

fU o + 

29029 ,2.6 

143 3 4 

341 

64 

1920 ^ CT 

~ 432 ^ ° + 

1209600 

46189 

a 10 >1 




256 

a ) 

Table 4 




1 

— f M 



C n 

n ! 

n \ 2 J 

d t r 




n a 


-t(-' + 4 ° 2 


) 


a -h jo - 8 a 3 j 

r 

1 / e 2 2 

T ( Tz" " 12(0 + e 


19 _2 

12 


2 4 \ 

jua + 16 a J 


\ {- ff + 32c ° 3 - 6t2 ° + T- ^° 3 - £o - 320 


1/127 2 t 2 

2 i 20 ‘^ a ~ 360 ** 


80 £ a 4 - £ 2 /j + 24£ 2 o 2 


359 „ 4 79 2 2 , 6 \ 

- lA^ + 3"6F " ° + 64a ) 
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7 


8 


9 


10 


1 /' 1 523 3 17 2„ 5 697 2 

T r To -6 *' 0 + i¥‘ M 0 + 192ea + 28F eA,a 


80. V + 8 , 3 a + + — — a 

24 60 2520 


- 128a 7 y 


1 ( 27441 „4 9 22, e 3 _ 6 

“ 7 E + 20760 ^ - 448,0 

r 

35709 2 2 17 2 2 2 4 697 3 

~ -2240- E + TTIO E ** + 2406 ° + 2240 6 " 

3_2 , 4 6241 .6 ,867 2 4 319 3 2 

- 40, a +. 


256 a 8 ) 


1 / 16721 5 1781 2 3 421 3 

~Z k ” 64 6 ^ ° + ^4^° - 60480 e ^ a 


, , „ 7 , 100091 2 3 2753 2 2 2 5 

+ 10 2 4 ,a + -j-344- , - 5720 * M a ~ 672 e a 


17609 3 ,,.33 4 44923 7 

c /u a + 160« a - 10 € a + ■ - - yo 


4032 


192 

3679 2 5 2047 3 3 /A ... V\ 

192 M a + 12096 pa 181440 " 512 a J 


1 / 2227 6 166641 2 4 14429 3 2 


t( 


6/u a — 


4480 


e /u a + 


100800 


e /u a 


6 n 


1814400 


8 130073 2„ 4 

2304ea - — ■— — € Mo 

448 


286877 2 2 2 421 2 3 t 2 6 

+ -- € IU a - 7777777rr € A* + 1792 c a 


67200 


604800 


, 161377 3, 2 2753 3,2 3 4 17609 4,, 

+ — ZT-7T— € JU a _ 6 /Li _ 560 e a - -ttttt- « A* 


5040 


67200 


40320 


L 4 2 5 210029 8 7759 2 6 

+ 6°, a - , - vo + -jjg- Pa 
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130973 3 4 

120960 ^ ° + 


1279 

1814400 


4 2 
A* a 


1024 o 10 ) 


n 


Table 5 


c 

n 


\_ 
n ! 


d 

dt n 



r 


_L ( 3 . 15 _2 

3 ( 2 6 + 2 




1 ( 15 

3 V 2 e a 
r 

T ( 


35 3 




€ 105 2 15 2 

8* " — 6C + T € 


23 

- -Trtuo + 


315 4 


_1_ ( 2J_ 

3 V 10 


, 315 3 

c M a + € a 


105 2 3 u 

~ € ° + U ^ a ~ 40* 


693 5 

“ a 


1/107 2 

T (— 


240 


2 3465 4 72 

M ~ — €a ' 20 £ ^ 


945 2 2 35 3 281 4 19 2 2 

+ — 6 0 ‘ l6 € ' + 48 M a + 16 - > 


16 


3003 6 \ 

a ] 


479 nrr 3 x 123 1 , 9009 5 589 2 

. e a + 77-7- c /j a + —77 — t a + 77-7 e 


560 


16 


112 


3465 2 3 , 315 3 


16 


c a + 


16 


1619 

16 


A* a 


127 2 3 

Li O 

48 M 
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6435 7 \ 

+ 1680 Q 16 a / 


/ 28437 

4 5967 

2^2 , € 

3 

; V 128 ^ 2240 ' 

M Q + 1 3440 


45045 6 

33801 2 

2 

123 2 2 


32 6 Q 

896 € M 

° + iiSo * * 


45045 2 4 

589 3 

3465 

3 2 

315 4 

64 6 ° 

+ 896 € M 

32 ' w T 

128 € 

34913 6 

1593 2 4 

383 

3 2 

109395 

128 “ CT 

+ 128 " a • 

- u o + 

13440 M 

128 

/ 46925 

5 881 

2 3 

253 

3 

r 64 

‘MO + -4g- 

€ j U O 

20160 

€ /U 0 


109395 

32 


7 12385 2 3 5717 2 2 ^ 

€ 0 + ~ 672^ € ^ a 


64 


135135 2 5 13915 3 „ 15015 3 3 3465 4 

64 € G 1344 € M 32 € ~ 128 € 


4 

U 


+ . ?253 2 5 3 3 _ 

64 ^ 192 M 1728 ^ 120960 


10 -V 


230945 9 \ 

128 G ) 

1 ( 1429381 


640 


6 60161 2 4 11863 3„2 

€Ma _ — — € fj. a + € H a 


640 


40320 


€ 4 

2078505 _8 

104575 2. ^4 


1209600 M 

256 € 

128 ‘MO 


2215 2 2 2 

253 2 3 

765765 2 6 


224 * M a 

201600 € M 

+ 128 ' a 


32017 3 2 

5717 3 2 

225225 3 4 

2783 

384 tM ° - 

67200 6 ** 

128 € a 

2688 




45045 4 2 693 5 111725, _8, 158291.2^6 

+ "756“ € ° - T56 € --^ Ma+ -W Ma 


787 34 

320 ^ ° 241920 


307 ^2 + 96^*9 ,10 ; 
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ON THE LONG-PERIOD BEHAVIOUR 
OF CLOSE LUNAR ORB ITERS* 

By J. Vagners** 

University of Washington 
Seattle, Washington 

SUMMARY 

******* 

In this paper semi-analytical results are presented for the long-term 
behaviour of close lunar orbiters. The Moon's aspherical gravity field is de- 
scribed by spherical harmonics through J 4>4 and the Earth is idealized as mov- 
ing in a circle in the lunar equatorial plane. After the short- and medium- 
period terms have been removed from the Hamiltonian, the long-period motion is 
analyzed from equi- energy trajectories in the eccentricity- argument of peri- 
center plane. Stationary, or resonant, solutions to the slowly varying equa- 
tions of motion are determined numerically and the results presented as varia- 
tions of (critical) inclination and eccentricity with semi-major axis. These 
computations are based on two sets of preliminary values of the lunar harmonics 
J n,m recently published by sources in the United States and the Soviet Union. 
Representative equi- energy contours are presented to illustrate the evolution 
of the long-term motion as influenced by orbital inclination, semi-major axis 
and the parameters Jn,m* Many stable orbits, in the sense of not impacting the 
Moon, are found even for high inclinations. 

Introduction 

In the analysis of near-earth satellite motion the dominant perturbation 
is the oblateness J 2 , of 0(l0 - ^)^, with all other (gravitational) perturbations 
of at most 0(l0" 6 ), or second order in J 2 . As can be shownC 1 ), to first order 
the Hamiltonian contains only short-period (periodic in the mean anomaly) and 
secular terms. These characteristics allow an analytical solution by succes- 
sive approximation wherein, to any desired order of accuracy, the equations of 
motion are reduced to trivial quadratures. 

This situation changes significantly when we consider the motion of a close 
lunar orbiter. The ratio of the mass of the Earth to the mass of the Moon is 
roughly 81 to 1. Consequently, when the orbits of a lunar orbiter and Earth 
orbiter are geometrically equivalent, the perturbation of the Earth will affect 
the lunar satellite 81 2 times stronger than the Moon will affect the orbit of 
the Earth satellite. From the early lunar orbiter flights it is known that the 
oblateness of the Moon is of 0(l0 -4 ) and that the higher harmonics J n m in the 
lunar potential may be as large as 0(lCT 5 ). Hence, neither the Earth ’ inf luences 


This work was performed in association with research sponsored by the National 
Aeronautics and Space Administration under Research Grant NsG 133-61. 

**The research was performed while the author was at Stanford University and is 
a part of his Ph.D. Dissertation. 

^ The central force field is taken as 0(l). 
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nor the higher harmonics are of second order in J 2 * fact, as the semi-major 

axis of the satellite increases above two lunar radii, the Earth influences ex- 
ceed, and eventually dominate, the oblateness effects. The result is that the 
variable part of the (slowly- varying ) Hamiltonian is factored by small para- 
meters of (roughly) the same order of magnitude so that successive approximation 
schemes fail. Nevertheless, several of the approximation steps may still be 
taken and then many significant features of the long-term motion extracted with- 
out actually integrating the equations of motion . 

In modeling the lunar orbiter problem, I will make the following assump- 
tions : 

. The Earth moves in an apparent circular orbit in the lunar equatorial 
plane; the actual apparent orbit is inclined about 6°41' (±9') to the 
lunar equator and has an eccentricity of about 0,055. 

# The Earth is spherically symmetrical and hence can be represented by 
a point mass. 

. The lunar gravity field is described by spherical harmonics through 
J 4 4 ; at the time of this paper no estimates of the higher harmonics 
were available. 

. Solar radiation pressure and solar gravity field effects can be ne- 
gl . cted . 

. The physical librations of the Moon can be ignored. 

Since for close orbiters the neglected terms contribute small 0(l0“ 7 ) perturba- 
tions or less, the following analysis should provide the correct gross behaviour 
The only exceptions might prove to be the third and fourth assumptions; radia- 
tion pressure probably significantly affects the results if one considers orbit- 
ers of very high area-to-mass ratio, for example dust. The other, more general, 
invalidating factor might be the higher zonal harmonics J n , n > 5. 


Long Period Equations of Motion 


In keeping with the assumptions on the apparent Earth orbit, I will take 
the reference plane to be the lunar equatorial plane with the zero-meridian to- 
ward the Earth. In terms of luni-centric coordinates the potential field of the 
Earth is expressed as follows 



(i; 


where (-ijr is the gravitational constant of the Earth, r^ is the (constant) luni- 
centric distance to the Earth, s is the direction cosine of the luni-centric 
angle between the satellite and the Earth, and P 2 is the second Legendre poly- 
nomial. The direction cosine s 'can be expressed in terms of orbital elements 


s = co s ( u + ft - fig) - 2 sin 2 i/2 sin u sin(ft £ - ft) ( 2 ' 

where ft is the longitude of the ascending node, ft £ is the mean longitude of the 
Earth, u is the central angle and i the inclination. 
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Following the device of Kozai 
ables as follows: 


( 2 ), 


I define a set of modified Delaunay vari- 


L 


G 


L y/l 


2 

e 


H = mean anomaly 
g = argument of pericenter 


(3) 


H = G cos i 


h = ft - ft^. 


where $,g,h are canonically conjugate to L,G,H and e,a are the eccentricity 
and semi-major axis respectively. The associated Hamiltonian is 


2L 


I 




n,m 


(4) 


incorporates 


where n E is the (apparent) Earth mean motion. The term 

all of the lunar gravity anomalies described by the gravity coefficients J R 
The complete explicit form for all n,m is not of direct interest here; it may 


be found in Refs. 3 and 4. Under the assumption that the Earth orbit is circular 
and in the lunar equator, no angular variables other than i,g,h appear in <h. 
Inclusion of Earth eccentricity and inclination would introduce angles such as 
the node and the argument of pericenter of the Earth orbit. 


In general, the Hamiltonian (4) will contain the following types of terms: 
a) secular terms, depending only on L,G,H due to even zonal harmonics and the 
Earth, b) long-period terms (periodic in g) due to all zonal harmonics and the 
Earth, c) medium-period terms, with periods of a month or fractions thereof, 
arising from the tesseral harmonics and the Earth, and finally d) short- or l- 
periodic terms from all sources. For the present problem the short- and medium- 
period terms may be removed from the Hamiltonian via the von Zeipel method 
The process will not be given here, since it is algebraically involved, although 
straight-forward, and may be found in Oesterwinter 1 5 ) and Giacaglial 6 ) . It may 
be noted here, however, that the referenced results are incomplete in view of 
the presently estimated magnitudes of the higher J nm LO(lO ** ) ] . In the quoted 
papers only the J 2 , J 2 2 and Earth short-period and’only J 22 and Earth medium- 
period terms were specifically determined. For the present purposes, I assume 
that all of the short- and medium-period terms have been removed leaving only a 
slowly varying Hamiltonian. 


In the literature, the roles of the canonical coordinates and momenta are 
interchanged by considering the negative of the Hamiltonian, denoted by 
and hence referring to as the Hamiltonian. I shall adhere to this (by now; 

well-established convention. With the understanding that all elements are 
slowly varying, the Hamiltonian can be written as (4J: 


15 


(■-$)(•-$)] 


; 3 h 2 ' 

\ + 


k " G 2 , 

I + 

16 4 

6 r 

3 

64GV [ 

(35 

4. 
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2 2 2 3^<L 2 

-5 (l - 7 cos i) e sin i - r~— ■ e sin i (l - 5 cos i) sin g 

J 8G 5 L J 


2 

sin g 


( 5 ) 

In Eq. 5 the Kepler elements e and i have been used as a convenient shorthand 
for the more involved canonical counterparts and the mean lunar radius R = 
1738 km has been taken as the unit of length. 

Since ‘i* is independent of time, it is a constant as are L and H since 
neither & nor h appears in Thus we have a single degree of freedom 

system described by the differential equations 


15 


7 3 

32 G L 


2 2 
e sin i 


(1 


2 4 

15 n E L 

2 

8 M* 


S)H) 


W 

5T 


( 6 ) 


Since the integral It * = constant is also known, the problem, in principle at 
least, is solved. The known integral could be used to eliminate one of the 
variables and hence g,G would follow by quadrature. However, it can easily 
be ascertained that the solution cannot be found in terms of known functions. 

In the von Zeipel treatment (l) of a near-earth orbiter whose inclination is not 
to close to critical, the long-period fluctuations were removed from If* at 
this point by a (further) canonical transformation and the problem reduced to 
trivial quadratures. Such a procedure fails in the present case as both the 
secular and g-dependent parts of the Hamiltonian have (essentially) an 0(l0 -4 ) 
multiplier. This can be deduced from (b) by noting that the multiplier of the 
Earth contribution, containing secular and cos 2g terms, has the factor^ (n^/ 
n') 2 after division by n 2 /2L 2 ; this factor is a quantity of roughly the same 
order as Jg. 


It proves convenient for numerical calculations to introduce the parameter 
rj and constants of the motion defined by: 


G /, 2/ 
T) = - = (l - e ) 


a H 2 , 2. 

P = — = (1 - e ) cos l 

L 2 


( 7 ) 


a 2 = 7 ^ 

2a 


“3 --S 
4a 


fe) ! 

fey 


* 1 JLi * 

n = <1 , the satellite mean motion. 
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3 J 4 /nl \ 2 
“ 4 ‘ 16a 4 \”E / 


which are dimensionless since R a has been chosen as the unit of length. Com- 
bining the constant parts (independent of g and G) with 5F* and some alge- 
braic manipulation yields the ’’energy integral" in the form: 


= cyrf 3 (l - 3 ^' 2 ) + |(5 + 3(3 - 6t} 2 ) - \ O^rf 7 \ [ 35(1 - (V 2 ) 


-40 ( 1 - Pif 2 ) + 8 j (5 - 3T| 2 ) - 5(1 - 7 £t}" 2 )(1 - r] 2 )(l - 

-a n' 5 (i - r\ 2 ) 2 (i - Pn’ 2 ) 7 ' (i - s^' 2 ) sin g - — (i - n 2 ) 


. (l - Prf 2 ) j^f a 4 T l” 7 (! * 7 ^~ 2 ) + l| sin 2 g 


1^ sir 


0) 


C - 


2 4 
n E L 


J - 


The form chosen here is such that if Qlg and 0!^ are taken as zero, one re- 
covers (essentially) the results of Kozai( 2 ). 

The equations of motion corresponding to this energy integral are deter- 
mined by differentiation of (5) according to (6): 


. 1 • 1 dSF 

* b l g= l 5T : 


8 L 


\{i - r\ 2 ){i - Pn" 2 )] (i - " 2 ) cos g 

a n 


-5(l - T] 2 ) (l - Prf 2 ) 


„ 5 7 
4a T) 


(l - 7ptt‘ ) + 


2 2 

2, He a 


sin 2g 


( 10 ) 


ci-* 3 

55” 4 L 


J, 2 2 V 2 

TJ (n 2 -se) + -5-n- 

a T) <X 


5J 


5 

32a r| 


4 ^- + 7 ( 2 P + l) 1 ! 4 


- 63(p 2 + 2$)r\ 2 - 231p 2 J + 


4 5 
2a T) 


- \ h 2 - 5?) [d - n 2 )d - f*f 2 ) 
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(ti - sp) 
2 
n 


al--P 

i - n 2 


+ R l IlT l - 15 ^ 


ti 2 - P 


5J 


sin g + < 


_ 2 2 
5n a 


. j^ 6 - (56 (3 + 7)n 4 + (63(3 2 + 723)n 2 ‘ 77£ 2 J + — (P - T} 4 ) j sin 2 g| 

(11 J 


Geometrical Solution 


Since it seems highly unlikely that a general solution in terms of known 
functions can be found to the system (lO) and (ll), I will attempt to obtain the 
general characteristics of the motion indirectly. From the known energy inte- 
gral (9) it is possible to construct, with the aid of a computer, equi-energy 
contours in the r\ 2 - g plane for given values of the semi-major axis a and 
(angular momentum) parameter {3. The form of the contours is also strongly 
dependent, for low a, on the values of the harmonic coefficients J 2 , J3 and 
J4 ; as noted earlier these values are not very well known at the time of this 
writing . 

From preliminary analysis of the U.S. lunar orbiter data^ the following 
coefficients are available 


J 2 = -2.3691 X 10 

J 3 = 3.366 X 10' 5 

J 4 = -1.368 X 10' 5 (12) 

( 7 ) 

Another (preliminary) set, as determined from the U.S.S.R. Luna 10 flight^ 
is 


J 2 = -2.06 X 10 

J 3 = -3.62 X 10“ 5 

J 4 = 3.33 X 10' 5 (13) 

The agreement as to the value of J 2 is not too bad; the U.S. value is closer to 
that determined from the physical libration of the Moon (-2.41 X 10~ 4 ) than the 


•^Results presented at Guidance Theory and Trajectory Analysis Seminar, NASA 
Electronics Research Center, Cambridge, Mass., June 1967, by W. T. Blackshear, 
et al. of Langley Research Center. 


220 


LONG-PERIOD BEHAVIOUR OF CLOSE-LUNAR ORBITERS 


U.S.S.R, value. As for J 3 , the signs are different, although the magnitudes 
are fairly close, as is also the case with J 4 - In view of these facts both 
sets of coefficients were used in determining typical "phase plane” (t^ I 2 , g) con- 
tours, thus in some sense indicating the sensitivity to, as well as dependence 
on, the values of the J 3 and J 4 harmonics. 

With the motion constrained to a given energy curve, by plotting equi- 
energy contours we can obtain the qualitative and quantitative long-period be- 
havior of g and eccentricity (or inclination). The time history on the con- 
tours can be obtained very rapidly via numerical integration of (lOj and (ll) 
for any contour of interest. 

Before presenting some representative contours, note that for each value 
of the semi-major axis there is a maximum allowable eccentricity in order that 
the radius of pericenter remains greater than the radius of the Moon. This con- 
dition is simply e^x = a ~ 2: • or in terms of T] 2 


2 2a - 1 

^crit " 2 

a 


(14) 


For convenience, this relationship is shown graphically in Fig . 1. All equi- 
energy contours in the q 2 - g plane are terminated at r) 2 rit since the cross- 
ing of the line b y any contour implies lunar surface impact. 

In Figs. 2 through 7 typical rj 2 - g contours are presented; Figs. 2, 3, 

4 were obtained using the Langley J n values, Figs. 5 and 6 were obtained for 
the U.S.S.R. values. In Fig. 7 the set of elements corresponding to the U.S. 
Lunar Orbiter IV initial elements was chosen and the corresponding trajectories 
were also computed via numerical integration of the long-period equations of 
motion. The numerical integration points are given at 30-day intervals to in- 
dicate the relative rates of Tj 2 , g on various parts of the contours. The 
maximum possible lifetime available for the parameters of Fig. 7 is roughly 
3.5 years; the appropriate contour is indicated on the figure. 

Inspection of the contours reveals several major features: 

, Stationary points with surrounding librating orbits where g is 
constrained to vary between (some ) limits . 

. Circulating orbits where g increases through 2it . 

. Impacting orbits resulting from either circulating or librating 
orbits . 

From these observations and Figs. 2-7, one can deduce that stable, or surviving 
orbits are possible even for high inclinations if the initial conditions are 
chosen (sufficiently) near one of the stationary solutions. 

I will, therefore, defer sane additional comments concerning the q 2 - g 

contours until the nature of the stationary solutions has been investigated in 
the next section. Only stationary points for libration orbits are depicted in 
Figs. 2-7; one might ask if there are any other types of stationary solutions 
possible. Stationary, "saddle- type" points are also possible, but were found to 
exist only for r|2 less than r] 2 rit , i* e *» under the surface of the Moon. An 
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example of contours illustrating the different types of stationary points and 
contours is given in Fig. 8 . In this figure only the interval 0° < g < 90° is 
shown; the contours for -90° < g < 0° are similar but not symmetrical about 

~ _ n 0 

to “ ^ • 

Stationary Solutions 

The stationary solutions are defined by r) = g = 0 and correspond to re- 
sonances in the critical inclination sense, i.e., commensurabilities of the 
anomalistic and nodal periods. In the near-earth satellite problem where 
dominated all other perturbations, and hence the nodal regression rate, one 
found resonance at the critical inclination of ±63.4°. As the semi-major axis 
of the orbiter is increased and the luni-solar perturbation effects increase, 
coupled with the decreasing rate of the node due to J 2 , one finds additional 
resonances emerging. For example, in an investigation of high- eccentricity , 
sun-perturbed orbits around Mars, Breakwell and Hensley (®) find eleven critical 
inclinations (resonances), all above i = 40°, The analysis of Breakwell and 
Hensley is inapplicable in the present case since it assumed that J 2 of Mars 
dominated the Sun perturbations, whereas in the lunar orbiter problem J 2 , J 3 , 

J 4 and (ng/n')^ are a n roughly the same for close lunar orbiters. Character- 
istic of these resonances are the associated large fluctuations in eccentricity 
and the inclination. 


Digressing, we note that the physical reality of such resonances was per- 
haps first verified by Explorer VI, launched August 1959, whose elements were 
i = 47.1, 


= 4.35 R, 


1 , - - “U, « = 0 . 76 , placing it very close to the critical in- 

clination''® of 46.4°. The original estimates of Explorer VI lifetime were 
roughly 200 years without accounting for luni-solar effects; when these pertur- 
bations were included, the lifetime dropped to 2 years (’). This lifetime 
estimate was spectacularly verified by decay before July 1961. 


Analytical or semi-analytical work on high- inclination , high-eccentricity 
resonance phenomena has been almost nonexistent. Classical celestial mechanics 
essentially ignores this problem since nearly all such orbit problems in the 
solar system, excepting some asteroids and comets, are blessed with small in- 
clination and eccentricity. Apparently some numerical-analytical work was done 
by Liouville and Halphen, Musen C® ) C 10") and Smith(ll) have recently revived the 
methods of Halphen, with modifications by Goriachev of U.S.S.R. , and applied 
them via computer to the Earth satellite problem when the luni-solar influences 
are (relatively) large. In his initial studies, Musen found that the eccentric- 
ity oscillated rather strongly in a large interval, going from large values to 
zero and increasing again with accompanying inclination changes of as much as 
20° (cf., Figs. 2-7). Such pulsating behavior of the eccentricity was also noted 
by Kozai(l 2 ) in his studies of high inclination and eccentricity asteroids per- 
turbed by Jupiter. 


Let us return now to the lunar orbiter problem and the question of station- 
ary solutions, or resonance points. From (10) one can readily see that T) is 
always zero for cos g = 0 or g = ±90°, for all values of x\ 2 and p. (To 
help in relating statements about t] 2 , (3 to orbit geometry, one can think of 

T| as "eccentricity" and P as "inclination.") For a resonance condition we 
must have g = 0 subject to sin g = ± 1 ; this condition can be written as 


2 [(1 - n 2 )(n 2 * P) J 


1/2 


5 J 

J 9 a 2 T) 6 (T ) 2 - 5P) + 2 xt ) 4 - C 
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+ 5 [r S + x(e ' ^ 

where 

C = n 6 + (143 + 7)t) 4 - (633 2 + 126P)il 2 - 23l£ 2 

C 2 = 5n 6 - (56P + 7)r) 4 + (63(3 2 + 72P)t) 2 - 77p 2 (l6) 

C 3 = 4T]6 " ( 35 ^ + 5 ) r l 5 + ( 35 ^ 2 + 41 ^) 1 1 2 “ 4 °P 2 


± J 3 ar! C 3 = 0 


( 15 ) 


2 7 4 


Note that as e- 0, T) 2 -* 1 and tL„ behavior of the zeros of (15) is dominated 
more and more by the J 3 term. Conclusions concerning the behavior at e = 0 
cannot be drawn from ( 10 ) since the variable g becomes undefined at e = 0; 
characteristics of the zero and near-zero e behavior will be examined later. 


The next possibility is that cos g ^ 0 and hence x\ = 0 implies: 


-2J 3 aT} 3 (T] 2 - 5p) 


5 [d - n 2 )(n 2 - p )] 1/2 [j 4 (ti 2 - tp) + 4x] 


(17) 


for e 4 0 and i ^ 0, (tj 2 = p). Substituting ( 17 ) is g = 0, we find the 
resonance condition 


J 4 (l - 7P) 


n 2 

x] (1 * 


t» 2 )(ti 2 - P) 


J 2 a 2 t) 6 (T) 2 - 5P) 


2xri 


!fi c 

32 1 


- [j 4 (n 2 - 7P) + 4xji 


2 T) 6 (h 2 - 5p)c, 


2 6/2 

a tj (“H 


53)‘ 




+ 4x(p - n 4 ) = o 


(18) 


In relations (l5) and (l8) we have the parameters a and 3 and ask for 
the positive roots r\2 < l (if any). The implication of (15) and (18) is that 
we no longer have the critical inclination problem in the usual sense, for now 
the resonance or stationary solution depends on eccentricity as well as on in- 
clination for a given semi-major axis. In addition, the argument of pericenter 
is fixed at g = ±90° or as determined from ( 17 ) . In the usual critical in- 
clination problem, as in the case of near-earth satellites, all perturbations 
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except J 2 are second order and hence the stationary solution, to first order, 
depended only on inclination. 

Since ilie resonance conditions are high-order polynomials in r \* , they 
must be solved numerically. Prior to discussing the numerical results, however, 
let us first examine some special cases of the resonance conditions: a) when 
the eccentricity is close to zero and b) when the semi-major axis is large. 

As the semi-major axis grows, the Earth influence increases, until at some point 
the J n effects may be considered "second order" and hence dropped from the 
equations. In doing so, the (long-period) equations of motion reduce to 


15 

^ - ' 8L h ' 

n 2 )(i 

- 0Tj 2 )n^a 2 sin 2g 

(19) 

,22 

• 3 v r 

* ~ 3 

3Lt| 1 

(5p - 

Tl 4 ) - 5(0 - T) 4 )cos 2g j 

(20) 

Thus T] goes to zero at g = 0° , 

+90° 

2 

and at T) - 1, 0. For 

n 2 = i, P 


the eccentricity and inclination are (respectively) zero and we encounter loss 
of definition of g. For g = 0° the only solution to g = 0 is for r| = 0 
which corresponds to a parabolic orbit of e = 1. For the remaining possibili- 
ties of g = +90° we find g = 0 for 0 = 0.6r|4, a result in agreement with 
the findings of Kozai,(l 2 ) Lidov,(l3) Williams and Lorell.(l4) 

In order to determine the small (and zero) e behavior, introduce the 
(non-singular) variables 


A = e cos g 
B = e sin g 

The governing differential equations for these variables are 


( 21 ) 


A = 


_ nu 


cos g - Bg 


B = - m 


sin g + Ag 


( 22 ) 


Substituting ( 10 ) and (ll) for T| and g, simplifying and retaining only terms 
up to e 2 , we find 


A = iT (1 “ e> l/2 U - 50) + 


5 J 

— “r (8 - 880 - 1 19p ) 
16a 


2a (1 - 0) 


2 2 

n E a J 2 

— — (3 - 50; - (l - 50) 
^ a 3 


(5 - 410 + 400 ) 


1/2 
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Jg(A 2 + B 2 ) 


• . A. - rt * ~ ” ’i 


(23) 


2 2 

, w \J 2n a 5J 


(4 - 56P - 147p ) 


J 3 B 


4 TTo (* - « IP + 40P ) 

2a 4 ( 1 - P) 1 / 2 i 


(24) 


For e = 0 the only possible stationary solutions are for £3 = 1 and £3 = 0.2 
corresponding to inclination of i = 0° and i = 63.4° respectively. Note 
that for small e, say e < 0.1, and small (3 (£3 < 0.5) we can integrate (23) 
and (24) by successive approximation with Jge 2 considered to be "second order." 
The integration technique parallels that of Breakwell and Hensley( 8 ) except 
that in the present case we no longer have the symmetry of coefficients and 
variables . 


In the general case, when the eccentricity may take any (allowable) value 
we determine the stationary solutions from ( 15 ) and (l 8 ) via computer. This 
was done for the two sets of J n values quoted in Section III and the results 
shown in Figs. 9 and 10 for the Langley and U.S.S.R. values respectively. Two 
major divisions exist in these figures; the solutions of (l 5 ) for g = ±90° 
and of (18) with g determined from (l7), the former appearing in the upper 
left, the latter in the left center of the figures. The loci of critical in- 
clination and eccentricity for a (representative) range of semi-major axis 
values are given, terminating at the maximum allowable value of the eccentricity 
for the semi -major axis in question. For the g = ±90° loci the +90° loci are 
shown in dotted lines, the -90° loci as solid lines. As noted earlier, these 
tend to the two zero eccentricity solutions of i = 0° and i = 63.4° as 
e - 0. Identified by its equation, cos 2 i = 0.6(l - e2), is the Earth-only 
curve which is the limiting locus for high-a orbits; the locus for a = 5 is 
indistinguishable for non-zero e from the limiting curve on the scale shown. 
Superimposed on the loci for g ^ ±90° in the upper left corner are contours 
of constant g values. The upper left corner region ceases to exist for a > 

2 since then the stationary solutions move "under the lunar surface,” i.e., 
they exist for eccentricities greater than the maximum allowable e. 

An interesting feature can be observed in the evolution of the ±90° loci 
as the semi-major axis is decreased from, say 5. On the limiting curve the loci 
for ±90° coincide. As one decreases the semi-major axis the plus and minus 
curves separate due to the increasing influence of the odd harmonic J 3 (multi- 
plier of J 3 term is sin g). The critical inclination for a given eccentricity 
is greater for g - +90° than for g = -90° if J 3 > 0 and vice versa if 
J 3 < 0 (cf. Figs. 9, 10), Furthermore, in the case of J 3 > 0, the maximum 
inclination attainable for g - -90° by decreasing a peaks out at about a = 
1.4, then decreases as a -► 1. On the other hand, for g = +90°, the maximum 
inclination increases monotonically and goes to 63.4° as a -* 1. 


The situation is modified when the J 3 and J 4 coefficients change sign. 
(From the present results it is difficult to estimate the influence of the 
slight J 2 value decrease, but presumably the majority of the changes are due 
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to the (significantly) different values of Jg and J 4 .) The ±90 loci still 
tend to separate with decreasing semi-major axis but now the curves tend to 
draw together again as a decreases below roughly 1.7, say. This is due to 
the proximity of the low- a curves to the inclination 63.4° for which the 
J 3 influence is minimum. The inclination for a stationary solution at a given 
semi-major axis and eccentricity is larger for the coefficients given by the 
Luna 10 flight than for the Langley coefficients for all values of a off the 
limiting curve. 


The last point to consider is the nature of the stationary solutions: Are 

they stable or unstable? Stability in this context means bounded variations of 
both and g for motion near a stationary point; furthermore, the variation 

of T ) 2 must be such as not to exceed ^crit* This question can be answered 
from analysis of the Hamiltonian and its partial derivatives at a stationary 
point. Since variation of g and T} 2 must be bounded near a maximum or a 
minimum of the Hamiltonian, the stationary points corresponding to the minimum 
and maximum are stable. The saddle points of the Hamiltonian will give the un- 
stable stationary points. These conditions are easily checked on the computer 
at the time of computation of the stationary solutions from the value of the 
determinant 


aV 

aV 

Sg 2 

SgcKj 

aV 

aV 

dGdg 

ao 2 


(25) 


From the canonical nature of the variables g,G, we have A as 


56 Sg 

5g 3 g 

Sg 

3g ~ 3g 


so that stable stationary solutions are characterized by 


A > 0 and 


Sg 

Si 


>0 or 


ctt 

Si 


(26) 


(27) 


and unstable stationary solutions by 

A < 0 (28) 

All of the stationary solutions of Figs. 9 and 10 were found to be stable, cor- 
responding to maxima of the Hamiltonian for g ^ ±90° and minima for g = ±90 • 
The "saddle points" of the Hamiltonian, corresponding to stationary solutions 
where equi-energy contours in the rj 2 f g plane intersect, presumably exist 
only for T } 2 < ^ 2 ^, 
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Conclusions 

In view of the results of the preceding section some additional comments 
on the T] 2 - g contours of Figs. 2-7 are in order. To aid in understanding the 
evolution of the curves one can plot the locus of stationary T] 2 versus £3 
for a constant semi -major axis: this is done in Figs. 11 and 12 for a = 1.7 
and a = 3.0 respectively for the Langley set of J n 's. The figures pertinent 
to this dicussion then are 2, 3, 4 and 11, 12. 

As £3 increases from zero, a stationary point develops at the appropriate 
value of g (Fig. 9) and moves "downward” in the decreasing q 2 direction, 
disappearing under the lunar surface. Increasing £3 still further, we encounter 
the g = +90° stationary point "rising" from below "n^rit soon joined by the 
g = -90° stationary solution. Further increase in (3 drives both +90° and 
-90° stationary solutions toward T) 2 = 1, and purely circulating (and, for the 
most part, surviving) orbits remain. For a > 2 we have no non 90° stationary 
solutions but otherwise the evolution is similar. 

For high- a orbits the influence of the J n 1 s is slight; hence the a = 3 
picture for the U.S.S.R. J n 's is not presented, being only a slight variation 
of Fig. 4. The primary difference for the a = 1.7 case with the U.S.S.R. 

J n 's is the "evening" of the ±90° stationary solutions on the q 2 axis, 
implying that the critical inclinations are somewhat closer together for g = 

±90° than in the case of the Langley J n ' s . 

In Figs. 11 and 12 for T| 2 very close to unity one seems to pick up an 
additional stationary T} 2 solution at a given £3 value for |g| ^ 90° and 
for g = +90°. The existence of such double solutions could not be verified 
from the equi-energy contour program due to numerical resolution difficulties. 

It seems that it would be more advantageous to investigate these regimes via 
the low and zero e equations. Although the rapid falling- off in inclination 
at low e (Figs. 9 and 10 ) was verified for the g = ±90° stationary solutions 
from the small- e equations in A and B, more work is needed in this area. As 
pointed out earlier, probably more progress could be made for the restricted 
eccentricity problem through approximate integrations of the A and B equa- 
tions and thus the behavior very close to T] 2 = 1 clarified. 

Finally, it can be verified that the behavior for semi-major axis in the 
range 1 < a < 2 is similar to that presented for a = 1.7, with variations 
only as to the location of stationary points with (3 and T) 2 ; for a > 2 the 
results are similar to those presented for a = 3,0. 
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ABSTRACT 


In the uniformly rotating reference frame of the restricted 3-body 
problem (in which Earth and Moon occupy fixed positions on the abscissa), 
the equilateral libration points L and L are known to be points of 
equilibrium. A particle placed at rest at one of these points will 
remain at rest for all times. According to linear theory, for very: 

small disturbances from equilibrium the particle will tend to move 

I 

i along bounded trajectories in the immediate vicinity of these points. 

I 

I When the force field near L and L is not assumed to be linear, 

I 

| and in addition other perturbing effects are included, the particle's 

| motion might be excited sufficiently and lead to unstable divergent 
trajectories. 

! This report presents the results of an analytic study of the 

3-dimensional stability of motion of a particle near in a nonlinear 
Earth-Moon force field, upon which is superimposed a linear solar grav- 

| itational field distribution. In particular, the long period features 
of the particle's motion are studied, which stem from the excitation 

at or close to the particle's natural frequencies, and are introduced 

I 

by the presence of resonance terms in the internal (Earth and Moon) and 
external (solar) force fields. 

The results show that in the presence of the internal nonlinear- 
lties the stability of motion predicted by the linear theory is valid 

| for only a very restricted region of initial displacement and velocity 
disturbances. Disturbances outside this region would lead to divergence 
of the solution. The nonlinear coupling of the out-of-plane terms with 
the in-plane terms was found to be of minor importance and did not con- 
tribute to an appreciable transfer of energy from one mode of motion 
to the other. 

The inclusion of the external force terms was found to admit some 
equilibrium solutions of the variational equations. Of those, the one 
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stable equilibrium solution found was characterized by a coplanar el- 
HnHr l <» orbit srotr.d which had its major axis (cf magr.itcds 

roughly 120,000 ml) oriented at right angles to the line Joining Earth 
to L^. This orbit was traversed in a clockwise sense at mean angular 
rate equal to that of the Sun, as seen in the rotating coordinate frame, 
and very close to the particle’s faster coplanar natural frequency. The 
particle's motion thereby became synchronized with that of the Sun. 
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I. INTRODUCTION 

The subject of the Earth-Moon libration points has aroused in re- 
cent years the curiosity and interest of a great many researchers in 
the field of celestial mechanics and analytical dynamics . This renewed 

interest by modem day investigators in this classical problem has been 

(1 2 ) 

stimulated by the recent telescopic sightings by K. Kordylewski * of 
two faint cloud-like objects or shapes in the vicinity of the and 
Earth-Moon libration points. These findings have led to a great amount 
of speculation regarding the origin and stability of motion of such 
clouds, believed by many to be composed of minute dust particles. 

Although a number of more recent naked eye sightings from high 
flying aircraft have since been reported by a few investigators in this 
country, the issue of the existence or nonexistence of these libration 
dust clouds has not yet been resolved to everyone’s satisfaction by 
any of the current studies, and is still the subject of debate between 
proponents and detractors of this hypothesis. While the definitive 
answer to this question might not be obtained until concrete evidence 
and data will be gathered near these points from a space vehicle, the 
quest so far has not been all in vain. In the process a great many 
areas for further research of both a theoretical and a practical, mis- 
sion oriented, nature have been exposed and tackled, which will keep 
many researchers busy for quite a while. 

In the present dissertation we shall not attempt to shed new light 
on the question of the existence of dust clouds, but shall confine in- 
stead our attention to the study of the interesting underlying theoret- 
ical problem in nonlinear analytical dynamics of a particle. This par- 
ticle may be associated, if one desires to do so, with the center of 
mass of a hypothetical dust cloud. It should be pointed out however 
that the uncritical application of some of the results and conclusions 
of the present study to the dust cloud problem might lead to mislead- 
ing conclusions, since such important destabilizing effects as solar 
radiation pressure and particle collisions have not been considered 
here. 
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II. LIBRATION POINT GEWBTRY 

SomP of f"hp ppnmplri na 1 foahnrn c nf 1 ibr 3 t ICTl points 2TC briefly 
indicated below for the purpose of orientation. 

The five libration points (also known as Lagrangian points) of 
the classical restricted 3-body problem (i.e., Sun is neglected, and 
Earth and Moon revolve in circular orbits about their common center of 
mass) are indicated in Fig. 1. They are points of equilibrium in the 
coordinate frame XYZ, rotating around the Z axis with the mean angular 
velocity n of the Earth-Moon system, in the sense that no net acceler- 
ations are experienced by particles at rest at these points. 



Fig. 1: Libration points of the restricted 3-body problem. 

By means of linear small perturbation analysis the collinear 
points L^, L^, L^ were found to be unstable to small initial distur- 
bances, while the equilateral points and L^ were found to be points 
of stable equilibrium around which small amplitude conditionally pe- 
riodic (i.e., in this case doubly periodic but not necessarily simply 
periodic) motions resulted for small initial disturbances. 

The more realistic physical model used in the present analysis is 
shown in Fig. 2. The Sun, lunar orbital eccentricity e .055) and 
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inclination i of the Earth-Moon plane with the ecliptic (i 5°) are 
included. The Earth is assumed to move in a circular orbit around the 
Sun. 



Fig. 2: Three dimensional geometry of the 4-body problem. 
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III. BRIEF REVIEW OF PAST WORK ON THE SUBJECT 

Most of the basic work cm the restricted 3-body problem stems back 
to some of the classical studies in analytical dynamics of Lagrange, 
Jacobi, Poincare, etc. which are discussed in most of the standard text- 
books on Celestial Mechanics. Some of the main features and results 
are briefly summarized in the following sections. 

More recent analytic work on the 3- body problem concerned itself 
with such questions as the existence of periodic orbits both in the 
vicinity of the libration points, as well as periodic orbits which fill 
the whole Earth-Moon space and possibly loop a number of times around 
both primary bodies. 

Studies which included the solar force field are of a more recent 
vintage and are predominantly of a numerical nature, in that they 
tackle the problem by direct integration of the full set of differen- 
tial equations of motion for various periods of time t, and usually 

(3-5) 

for a very restricted set of initial conditions (i.e., zero par- 

ticle displacements and velocities, and collinear position of the major 
bodies in the order Earth-Moon-Sun) . The application of Hamiltonian 
techniques to the 2-dimensional libration point problem was suggested 
in an analytic study by Breakwell and Pringle.^ ^ These techniques 
are extended in the present thesis to the 3 -dimensional problem which 
also includes the effects of lunar orbital eccentricity. 

1. THE CLASSICAL RESTRICTED 3-BODY PROBLEM: PAST RESULTS 

AND THEIR LIMITATIONS 

Some of the basic results of the 3-body theory, as related to the 
libration points, and some of the questions left unanswered by the 
theory are mentioned in A and B, respectively. 

A. 1. The existence of the five Lagrangian equilibrium points 
shown in Fig. 1 was discovered. 

2, The stability of motion near these points was investi- 
gated by linearizing the equations of motion near these points. 
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3. For small deviations from equilibrium the coplanar homogen- 
eous set of equations (Eqs . (25) with p = v = m = 0) in the xy plane, 
which becomes uncoupled from the z equation, was shown to give rise 
to a doubly periodic solution with the eigenvalues a; .955 and 
.298 (these frequencies were nondimensionalized with respect to the 
mean Earth-Moon angular velocity n 2; .23 rad/day). The uncoupled, out 
of plane, linear equation in the z direction possesses a simple har- 
monic solution with eigenvalue = 1. (The reason for a period of 1 
lunar month in the z motion is easy to explain physically if we con- 
sider the limiting case of a vanishingly small lunar gravitational 
force field. In that case the small particle at follows a near 
circular planar 2-body orbit around the Earth at the lunar distance, 
which crosses the Earth-Moon plane twice for each complete particle 
revolution, thus leading to an orbital period of 1 lunar month, which 
is also the same as the period of the projected simple harmonic oscil- 
lator in the z direction.) 

4. A first, and only, integral constant of the motion was found 
to exist. This so-called Jacobi constant Cj corresponds to our scler- 
onomic (i.e., time independent) Hamiltonian H, and consists of the com- 
bination E - nh^ = constant = - Cj = H, where E is the particle's total 
energy (i.e., kinetic and potential) in a nonrotating baricenter cen- 
tered coordinate frame, h z is its angular momentum in the Z direction, 
and n is the mean angular velocity of the Earth -Moon axis. 

B. Same difficulties are encountered if one tries to extend the 
stability conclusions obtained from linear analysis to predict the be- 
havior of the complete nonlinear system. The main reasons are indi- 
cated below. 

1. The near coranensurabi 1 ity of the eigenvalues ^ 
leads to an internal near resonance with a detuning = ^ ^ “ 

,954593-3‘ .297912 = .06086. This causes poor convergence of the 

usual perturbation solutions by means of which one attempts to evalu- 
ate the effects of higher order terms, by substituting back the homog- 
eneous solutions into the nonlinear driving terms. Sane of them give 
rise to combination frequencies which are nearly resonant with the 
natural frequencies of the linear equations, and thus lead to small 
divisors in the next approximation. 
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2. The Hamiltonian H is not definite near or L,. (positive 
or negative). This sign indefiniteness has a bearing on the nature of 
fKo of af j j scent motions , oo ic briefly indicotod be lev. 

If we suitably recombine the terms in H of A (4) above we can c«ne 

1 2 

up with an equivalent relation for the Hamiltonian H = r v + V , 

2 .2 .2 2 1 erE 

where v = x + y + i and V represents an effective potential 

122 2 ett 

energy V e ^ = - j uj (x + y ) - p^/r^ " 7116 first terra in H 

thus corresponds to the kinetic energy, as measured in the rotating 
frame, while the last two terms in V e ^ represent the usual gravita- 
tional potential energy V. In this new form H can be interpreted as 
being in the nature of an energy integral of the motion. The nature 
of the stability near ^ can thus be deduced from the shape of the 
surfaces V e ^ ~ constant in that region. It turns out that near the 
equilateral points the planar part of V e ^ has the shape of a "poten- 
tial hill" rather than the "trough" which is required for stability. 

This circumstance raises a question concerning the applicability 
of the linear-theory stability analysis to the complete nonlinear sys- 
tem, i.e., whether the nonlinear system would exhibit the same kind of 
stability as predicted by the linear equations for given initial con- 
ditions. One may remark at this point, on the basis of work to be pre- 
sented later, that the answer is yes in a rather small neighborhood of 
L^. The nonlinear system will however exhibit instability for certain 
ranges of initial conditions. 

It is also appropriate to remark here that the stability of motion 
exhibited by the linear system near and L,. in the presence of a po- 
tential energy "hill" is brought about by the presence of gyroscopic 
terms in the linear equation (due to the Corioli's force 2(n x r) which 
arise in the rotating frame). When further nonlinear and external ef- 
fects are included, it is possible for additional energy to be trans- 
ferred into the system with the result that initially small oscilla- 
tions may grow in the course of time. 

It is interesting to mentidh that a Taylor series expansion of 

V near L, shows the equi potential curves to be extremely elongated 
eff 4 

ellipses of fineness ratio roughly 1:10 oriented at right angles to 
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the line from barycenter to L^. The potential field thus falls off 
quite slowly as we move in a direction perpendicular to the Earth - 
line. 

3. Another internal resonance occurs because of nonlinear 
coupling of the z and xy solutions, and the near coranensurability of 
the eigenvalues 2 : oj^, with the resulting detuning — *°454. 

This resonance leads again to poor convergence of perturbation type 
solutions . 

4. Although not actually a part of the classical 3-body 
problem, it might perhaps not be inappropriate to mention at this 
point also the presence of a third important resonance of an external 
nature caused by the Sun's perturbative action on a nominally circular 
lunar orbit, which is an important factor in the subsequent analysis. 

This indirect solar perturbation leads to a detuning = 2[u)^ - (1 - m) ] 
= 2 [.95459-. 92520] .05878. 

5. The additional complications of resonances introduced by 
the inclusion of lunar eccentricity terms will be taken up later. 

2. NOTCRICAL APPROACHES (SOLAR EFFECT INCLUDED) 

Straightforward integration of the complete set of differential 
equations, for zero initial conditions, gives rise to particle tra- 
jectories, a typical xy projection of which looks roughly like the 
one shown in Fig. 3 (taken from Ref. 4) . 

Figure 4 presents schematically another plot due to Feldt and 
Shulman^ of total particle displacement d with time t for an inte- 
gration time period of 5000 days. Initial conditions were the same 
as those in Fig. 3. 
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IV. SOME CONCLUSIONS REGARDING PRIOR STATUS OF THE PROBLEM 

The following conclusions sunmarlze some of the points which were 
raised in Sections III(A) and III(B): 

1. The past analytical efforts do not resolve in a satisfactory 
manner the question of boundedness of motion near the equilateral li- 
bration points of the Earth -Moon system, with or even without the in- 
clusion of the perturbative effect of the Sun. 

2. The numerical results available to date are rather limited 
in that they were generated only for restricted sets of initial con- 
ditions and initial Earth-Moon-Sun configurations. Consequently they 
do not shed much further light on the question of the possible exis- 
tence of domains of initial conditions and configurations which allow 
small amplitude, bounded motions to take place for long time periods. 

3. In view of the multiplicity of possible starting conditions 
and configurations, it is quite clear that a purely numerical search 
for such initial conditions would be both costly as well as of ques- 
tionable success, and thus not very attractive. 

4. The necessity and usefulness for further analytical ground- 
work on this problem seems to be clearly indicated. 

The above brief rundown will hopefully help to bring into better 
perspective the difficulties as well as the motivations underlying 
the present investigation. 
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V. THE LAGRANGIAN L FOR A PARTICLE NEAR L, 

A 

We shall desire the expression for the Lagrangian of a particle 
near the libration point, in the rotating xyz frame centered at L^, 
and having its xy plane coincide with the fundamental Earth-Moon orbi- 
■ tal plane. To this end it is convenient to start out with an inertial 

reference frame in which the positions of Earth, Hoon, Sun 

| and particle P are designated by the numbers 1, 2, 3, and 4, respec- 

tively, and by the position vectors R^ (i = 1,* , **A). The kinetic 
| energy and potential energy of all the masses are then 


T I “ ? I -iV^i 


i= 1 
A 




Gm.m, 

!_J_ 


. . . |R. - R.| 

i,j=l ' l j' 


--il 


i,J=l 


T ij 


(1) 


We switch first to an Earth centered rotating coordinate system ,Y^ ,Z^ 
► with the X g axis pointing in the direction of the instantaneous position 

i of the Moon (we neglect here the 3000 mi separation of barycenter from 

the center of the Earth) . For a particle of unit mass at point 4 we 
then have 


T 2 (*1 + r lA) * ( R 1 + T 1 a) 
*1 ^2 ^3 


V * - 


(3) 


1A t 2A 3A 




1A 


r lA + 2R 1 


14j r lA r 2A r 3A 7 1 1 


i 
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where 


i = 1,2,3 

The last term in L is independent of particle position and velocity 
and can be dropped. This follows from our assumption that the par- 
ticle does not affect the motion of the primary bodies. It is also 
convenient to remove from L the explicit presence of the Earth's in- 
ertial velocity R^. This can be done via Lagrange's equation 


d_ 

dt 



(4) 


and the Earth's equation of motion in inertial space 


" ^2 - ^3 — 

R 1 ~ 1 “ r 12 + 3 r 13 
r l2 r l3 


(5) 


Since ^ R^ , one can replace Eq. (5) by the equivalent 

relation 



^2 — — ^3 — 

“3 r 12 * r 14 + 3 r 13* r 14 

r 12 r 13 


( 6 ) 


After substituting Eq. (6) into (4) one can extract from it the 
expression for L shown in Eq, (7): 


L = 


1 - 
2 r 14* 




(7) 
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The last (solar) term in (7) can be further simplified if we re- 
place it with the solar potential energy gradient evaluated at the 

position of the Earth , as shown in Appendix A. This neglects terns 
3 -8 

of magnitude (r^/r^) 5? 1.5 x 10" , which is quite satisfactory in 

the present case, and leads to the expression 



Expression (8) is still not in the desired final form of a Taylor 
series expansion around L^. Before we carry out the expansion it is 
convenient to nondimens ionalize everything, as indicated in the next 
section. 
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VI. N0NDI1CNSI0NALIZATI0N AND EXPANSION AROUND L, 


l. «UWL»XflC.WaiUWAL,liATlUN 


The nondimensionalization is most conveniently carried out by 
choosing the reference frequency n and length D defined by 


< + 6 cos i > = mean angular veloc- 


ity of E-M axis X .23 rad/day 


D = < r 12 > = mean E-M distance ^ 2.4 x 10 mi 


It should be pointed out that the only physical quantity which 
can be measured with any degree of accuracy is n, so that the refer- 
ence length D is actually a computed, rather than a natural quantity, 
and is defined by Eq. (9). The averaging of r 12 in Eq. (10) must there- 
fore be interpreted in the light of the more basic definition (9) . 

(u^ denotes the mean angular velocity of an isolated Earth-Moon 
system (no solar perturbations present) , and ft and i are indicated in 
Fig. 2. 

Two basic dimensionless quantities which will appear often in our 
equations are 


t*l + “2 
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where 

n s = angular velocity of the Earth around the Sun 

From now on all lengths, velocities and times will be treated as di- 
mensionless quantities, but we shall retain their old symbols. 

2. EXPANSION AROUND L, 

Just as n was the basic quantity selected in the nondimens ionali- 
zation of the equations, we shall select m as the basic quantity, or 

yardstick, which defines order of magnitude. We shall denote by o(m) 

2 

a quantity of first order of smallness, o(m ) of second order, etc... 

The Lagrangian L of Eq. (8) can be written in terms of displace- 
ments and velocities measured in the centered xyz frame by writing 
the dimensionless vector relations 


(13) 


where 


r = xi + yi + zi 
x y z 

|r-_ | = 1 + p(t) = |r 12 | = instantaneous displace- 
ment of li brat ion point 
from the Earth 

F IL = I'll l(r r x + 4 r y) + « X ? 1L 

and for the total angular velocity to of the xyz frame in inertial space 

CD = — i + u(t) = i + u(t) (14) 

n z z 

p(t) and u(t) are the perturbations of the E-M distance, and angular 

velocity caused by solar and eccentricity effects, and are provided by 
Cl 8) 

classical lunar theory. * J 
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p(t) = - .0079 cos 2^ - .00093 - e cos 0 + e^ (1 - cos 20) 
15 

g wuo “ W V 1 

u(t) = £q sin i sin + i cos T^J i^ 

+ sin i cos 7^-1 sin i^ 

+ £.0202 cos 25 + 2e cos 0 + ^ em cos (2£ - 0) 

20 ] 


+ j cos 20 | i_ 


- u i + u i + u i 
xx y y z z 


(16) 


For additional details regarding the above expressions, and for an ex- 
planation of the various angular variables used, the reader is referred 
to Appendix B. The coordinates of the Sun in the ^ e >^ e »^ e ^ rame » Pre- 
sented in Eqs. (17) are also developed in this appendix. 

The Sun's position coordinates in the rotating frame are 


* 8 a r 13 cos g 

y s “ - r 13 sln ^ 

z ■ r. _ sin i sin (0 - v') 
8 13 


(17) 


We now stipulate that the following quantities will be treated 
as being of the first order of smallness: 


m » e *x,y,Z|P x> Py ,P z> «yp'(t) , »Jv T^T 


(18) 


The momenta P ,P ,P 
x y z 

relations 


conjugate 


to x,y,z * re introduced through the 
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— - = p -J£ 

ai x 2 


P = p + i 
y ay y 2 


p = — = p 
Z Z 

dz 


( 19 ) 


The terms linear in e(^ .055) in p(t) and u(t) are obviously only 
of o(m), and will have to be treated in a different fashion if we are 
to retain the definition of Eq. *18). This problem will arise when we 
include the eccentricity in the canonical transformations to slow variables. 

The use of a Taylor series to expand L and H around in terms 
of x,y,z, P , ... etc ... raises the question of how many terms of the 
series expansion have to be retained before we truncate it, i.e,, what 
order of nonlinear terms must be retained so as to take into account 
all the dominant perturbative effects. This question is readily an- 
swered by noting that the highest internal resonance is that resulting 
from the near equality — 3a>2 which indicates that nonlinear terms 
up to and including the fourth order must be retained in the Taylor 
expansions of L and H. 

When all the steps have been carried out and all the terms col- 
lected, as shown in Appendix C, one obtains for the Hamiltonian H, de- 
fined as usual by means of 


H = p r r - L 


(20) 


where 


T 


^.Py.^] =dX3) 


row matrix of momenta elements 


and 


x 

k 


(3 x 1) column matrix of position elements 
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the expression of Eq. (21) 

u _ x - j i x x x f 1 >T> w p ^ 4- — (y" - 

- - - 1 2 \‘x "y ' ~zJ ' V"x ~y ) 8 \ } 

. M (1 . 2li )xy - J P ( p x + JS P y ) + \ (p + \) G/3 p x ' P y ) 

+ \ ( Dy - J5 0^ - ( p + ? 0 (* + Jiy) 

- ” 2 p2- k + ^a)( X s X + V + *.*) - \ ( x + ^ y )j 

L 2r 13 J 

j(°) 

+ J (73 Vy + 0^ 

+ ji# ( xZy + y3 ) + Hr* ( 33xy2 - 7x3 - 12] “ 2 ) - ¥ yz2 } 3 

+ j ^ ( 32 ~ 2 ^ - (5x 3 y - 9xy 3 + 12xyz 2 ) + Hg * 4 + yj x 2 z 2 + y§ y 2 ^ 2 

- e 2 ■ hi - 1 4 ♦{- i ■ » 2 * *■’ - ^ ° • 

* •,(»', - »,) * ",(- p , - "j * ■*("'» - ".) 


■ 2 Brk-v) J -i(“ 2 ^ 2 *- 2 )]} 


+ 0(m 5 ,rn 6 ) etc . 


In the above expression we have split H / into a cubic part H^ t a 
quart ic part and a solar part H s> which in turn is composed of in- 
direct solar effects (via p and u) and a direct solar effect (via the 
2 

m term) . 

We shall concern ourselves in Section VII only with the motion 
resulting from the bracket [ which represents the linear and 


254 


THREE-DIMENSIONAL, NON-LINEAR RESONANCES 


quadratic part II ^ of H. These terms give rise to a system of forced 
linear differential equations which will be discussed below. 

The analysis of the effect of the terms in H 7 on the motion of 
the particle will be started in Section VIII. 
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VII. THE LINEAR DIFFERENTIAL EQUATIONS AND THE TRANSFORMATION 
TO NORMAL CANONICAL COORDINATES 


Hamilton’s equations can be written down in a very compact form 

by using the matrix notation. We define the (3 x 1) column matrix 

for r and P in a manner similar to those introduced for r and P 
r r 

in connection with Eq. (20), and introduce the additional (1 X 3) row 


matrix of partial derivatives of H 


(o) 


. [: 


H (°) 

"p 


«<o> 

“5x * 

ap • 




an 


(o) 


ap 


~§z J 

at<°>l 


( 22 ) 


The equations of motion then can be written in the form 


r 


lYf)] 



r T 


* * 


P 

o 


r 


pT 

- 


r 


(6x1) column matrix 


(23) 


where and HpT are the transpose of and respectively, 

r r r *r 

$ is the (6 x 6) matrix 


(;) 


(24) 


I is the (3 x 3) identity matrix, and 0 is the (3 X 3) null matrix. 
In component form, Eq. (23) becomes 

4 - = p x + y + {- \ p + 4 (p + »,)} 

f = = P y ' * f {■ ^ P ' H p + ■'*)} 

(cont. on next page) 
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i " “pf " P z + {? ^ u y - ^ u x)l 

s 

K - - H x 0) - p y - r x + H 0. - 2p)y + |(p + | 0 

s 

*y = - H y 0> = - P x + ! y + H < X - + |V3 (p + i 

*" 2 + 


P = - = - z + 


I u y + *x) + ^ j^T- (Vs + ^ y s 2 s) j 


(25) 


The terms in { contain the direct and indirect solar contributions. 

The homogeneous part of Eq. (25) Is obtained by setting p = v= m = o. 
The characteristic equation resulting from a trial solution e 1UJt is 


(l - U) 2 ) [m 4 - m + - T) 2 ] = 0 (26) 


where 


n = ^ (1 - 2n) = 1.26753 
The solutions to Eq. (26) are the eigenvalues 
uu 1 = ± .95459 

= ± .29791 (27) 

to- - ± 1*0 (corresponds to an 
uncoupled z motion) 
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The above u/s are the natural frequencies which were used in the 
discussion of the detunings in Section III. 

Let the 5 Glut luiis u£ the homogeneous sec o£ equations be ueuuteu 
by x,y,z*««and suitable particular integrals by x,y,z*** Thus the com- 
plete solutions are 

X = X + X 

y = y + y (28) 


For later use the 6 constants of integration which appear in the 
solutions (28) are best introduced by transforming first to a normal 
canonical set of coordinates Q and momenta P 


q T = [Qj. Q 2 . Q 3 ] r T - [V V P 3 ] 


(29) 


which also satisfy Hamilton's equations of motion and represent un- 
coupled motions in the form of independent simple harmonic oscillations 
having as frequencies the three eigenvalues 

The linear equations of transformation can be written in the form 

/ Q i 
<>2 
Q 3 
P 1 
P 2 
\ P 3 

where J is a (6 X 6) matrix whose columns consist of the eigenvectors 
corresponding to the eigenvalues ± u^, and which are normalized so as 
to satisfy Eq. (31) which is the necessary condition for a canonical 
transformation. 
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j $ 


J T = 


(31) 


The matrix presented in Eq. (32) satisfies this conditions and thus 
provides the proper coordinate transformation. 


2K^uj 


- 2K 2“2 


- K^a) 2 + i) - Kjdj^m 2 + i) 0 




0 


where 

Ki = {, n ^ + 2T ^.« 1} - 1/2 

K x = .62016 
= .72101 



0 0 0 


(32) 

i = 1,2 


The numerical values of the elements in J are 


0 

0 

0 

2.05374 

-5.66028 

o" 

-1.24032 

-1.44202 

0 

-.823463 

3.06768 

0 

0 

0 

0 

0 

0 

1 

-.687459 

-.0727629 

0 

.823463 

-3.06768 

0 

.750378 

.272262 

0 

.869732 

-5.23066 

0 

0 

0 

-1 

0 

0 

0 


(33) 
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In terms of Q and P the Hamiltonian (for the case p = y = m - o) 

becomes 


The solutions for the three harmonic oscillators which make up 

ovnro c c { rm f nr 


in Eq, 

. (34) 

car 

1 be g 


n/So7 


4 

Q, = 

- 

sin 

U). pC 

1 



1 K 1 




4 

q 2 


sin 

“*2 fl 2 




J. 

II 

<r> 

Cf 

a> 3 

sin 


p l = 


cos 



(35) 


P 2 ■ - cos ^2 

P3 * cos ^3^3 

where p^ = t + p^ P 2 “ t ~ Pj* = + P 3 » and ^1*^1 are the 6 re_ 

quired constants of Integration. 

Substitution of Eq. (35) and the J matrix (33) into Eq. (30) gives 
the homogeneous solutions for the coordinates 

x = 2.902 cos + 8.003 cos u^p^ 

y = 2.103 cos (u^ + 123.57°) 

(36) 

+ 4.793 cos (u^ + 154.82°) 

z * COB 
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The particle trajectories in the xy plane corresponding to each 
of the two coplanar normal modes are ellipses with major axes at right 
angles to the vector r^ and thickness ratios (minor axls/major axis) 
1:2 for and 1:5 for u> 2 as shown in Fig. 5. Motion proceeds in a 
clockwise direction. 



Fig. 5: Trajectories of normal modes. 


The complete unperturbed xy motion consists of a weighted super- 
position of these two normal modes, and is in general not periodic. 

The particular solutions S£,y, corresponding to the forcing func- 
tions contained in the { 3 brackets of Eq. (25) are most readily ob- 
tained from the coplanar equations 


x-2y--^-x-T 1 y = f - f + f 
4 1 J x y P 

x 

y + 2x - 7) x - | y = f y + f x + f p 


(37) 
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where f , f , f D , f n denote the direct and indirect solar forcing 
x y *x H y 

functions of the subscript variables given in equations (25) . For 

our purposes it is sufficient to obtain the particular solutions to 

2 

o(m ). After introducing Eqs. (16) into (37) we obtain the solutions 


x = 5c + 2 + 2 + x_ + 2,, + x 

o e em 'l c 


y = + + y„ m + y 2 + y_2 + 


(38) 


where 


x q = .01016 cos (2§ - 67. 2 U ) 
? = .00867 cos (2^ + 38.3°) 


Resulting from the 
indirect solar terms 


= 


.31 e cos (0 - 72.2 ) 
.227 e cos (0 + 50.2°) 


Sc = 11.1 em cos (2E - 0 - 75.2 ) 
em 

y - 7.86 em cos (2£ - 0 + 51.76°) 


1 .274 e cos (20 + 30.8 ) 


2 „ = 1.062 e cos (20 - 66 . 0 ) 


x m 2 = 

y 2 - 

m 

2 = .50 e 


1.697 m cos (2? - 127.7 ) 
2 


1.43 m cos (25 - 20.83 ) 

2 1 


Resulting from the 
direct solar terms 


57 = .2895 e 


Constant displacement 


No particular solution for z is retained since it is of o(m ) or 
higher, and would lead to terms of o(m 5 ) when substituted into H 7 . Oi 
this point we shall have something more to say in Section XIII. 

The corresponding solutions for P are readily obtained from the 
relations 
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P 

X 


* - y 


P = y + x 

y 

etc . 


( 39 ) 


It is interesting to note that if we substitute Eq. (35) into 
Eq. (34) we obtain the simple expression 


„(°) _ 

H = - 


+ Of, 


(40) 


The particular manner of introducing the polar set of integration 
constants into Eq. (35) follows from the canonical relationship 

which they bear the Hamiltonian H^°\ The quantities 0^, - 0^ , 63 and 
t > ^3 form, respectively, a canonical set of coordinates and con- 
jugate momenta with respect to of Eq. (40) . 

Thus 


K 


1 = H<°> 
°1 

-1 - H ( °> 


(o) 


“1 - - v 


= + *4 


0 or = const 


(o) 


(41) 


£ - 1 - H (o) 

3 “3 


“2 


H (o) 

V 

j 


The above results are in agreement with our stipulation that 0 
and 0 ^ be constants. 

Furthermore, the quantities and 0^ themselves form a canonical 
set with respect to an unperturbed Hamiltonian H = 0. 

The above canonical properties will be made use of when we analyze 
the perturbative effect of H*. 

The form of in Eq. (40) makes it very easy to verify the 

point made earlier in B(2) of Section III, regarding the sign indeter- 
minacy of H which is seen to depend, for small ar^* on the difference 
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• Although in the present case or^ and or^ individually are 
constants, it turns out that for the case s 0 the combination 
c^(t) - (^(t) remains a constant of the motion also when the pertur- 
bative effects of higher order internal nonlinearities are included 
(but external solar perturbations and lunar eccentricity are still 
neglected). Thus and may grow individually as long as their 
difference remains fixed, which indicates the possibility of an in- 
ternally generated instability near also for the classical re- 
stricted 3-body problem (for which we use the exact expression for H) . 

That H is a constant of the motion in the latter case (where 
H ^ H(t)) as stated in A(4) of Section III, is readily verified since 

= using Eq. (23) = - p T f + i T p = 0 (42) 

at or <jp 

The only existing integral of the motion, the Jacobi constant Cj. 
(see pp. 281 of Ref. 9 where it is denoted by unsubscripted C) , is 
equal to the negative of H 

H = - Cj ( 43 > 
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VIII. MODIFICATION OF THE LINEAR SOLUTION DUE TO h' 

The inclusion of the terms in the Hamiltonian H 7 , neglected until 
now in the previous solution, can be handled by a method equivalent to 
the customary variations of constants technique by requiring the orig- 
inal constants of integration at and B introduced in Eq. (35) to become 
functions of time, which then satisfy Hamilton's equations with Hamil- 
tonian H 7 . 

Inasmuch as we are not concerned in the present investigation with 
an exact or detailed determination of the particle's trajectory, but 
rather in the overall broad features of the motion, we shall desire to 
obtain only the slowly varying components of ct and 9 which will arise 
from the secular terms in H 7 , and those terms containing low combina- 
tion frequencies which arise from the near resonances. 

This can be accomplished by means of a suitable canonical trans- 
formation of coordinates from the polar canonical set at, 9 associated 
with H = 0 to a new slowly varying canonical set a* ,$* associated 
with a new slowly varying Hamiltonian K 7 . K 7 will contain only the 
lowest frequency terms which arise in H 7 as a result of the above 
transformation, all other faster terms having been suitably eliminated. 
The question as to which frequencies should be retained, and the cut- 
off point beyond which the periodic terms are dropped cannot be readily 
answered in general terms, but would depend on the particular problem 
considered, and also on the density of spacing of the resonance peaks 
in the lower end of the frequency spectrum. This point will be touched 
upon again later in connection with the specific form of the expres- 
sion for K 7 . 

Returning once more to the coordinate transformation mentioned 
earlier, it is reasonable to assume that for relatively small displace- 
ments x,y,z of the particle, the effect of H would be in the nature 
of a perturbation of the linearized solution found earlier. With this 
assumption in mind we may now consider a stationary contact transfor- 
mation 
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R # = R 4- K Q 

"i "i MM i 


(44) 


that may be introduced with the aid of a generating function G(0,a / ) 


C(B,<*') = Bo' + S(B.a') 


which satisfies the relations 


( 10 ) 


(45) 


B* = — = P + S , 
?**' a 


(46) 


The first term Po ,/ in G generates the identity transformation, 
while the function S(P,Q' / ) = + S£ denotes an additional suitably 

selected generating function which is introduced for the specific 

purpose of eliminating all the short period terms which occur in H*: 

3 4 

is selected to eliminate the terms of o(m ) and those of o(m ). 
Since S does not depend explicitly on time t we can write 

K'oW) = H(p / ,o / ,t) + h'CpW) (47) 

where H above is evaluated in terms of the new coordinates P* and new 
momenta ct * . 

When all the required steps of the transformation are carried out , 
as indicated in Appendix D, one arrives at the following relation for 
K' 


K = 


H 4 - \ [ H 3* S ll 


(48) 
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and are the long period terms resulting from the Taylor 
series expansions 


H. 

1 


). 


n-1 




(49) 


evaluated at x,y,z. 

[H^jS^] denotes the long period part of the Poisson bracket of 
with S^. results from the substitution of the homogeneous so- 
lutions x,y,z into H, , and consists of an internal part H, . and an 
_ 4- 4int 

external part which contains both the direct and indirect solar 

effects . 

The algebraic work needed to express K 7 in terms of a 1 and t 
is rather formidable, and is one of the major stumbling blocks in what 
would otherwise be a relatively straightforward solution. A few repre- 
sentative steps of the required manipulations are briefly demonstrated 
in Appendix E. If all the manipulations have been successfully carried 
out, one does eventually come up with an expression for K 7 which has 
the general form shown in Eq. (50). 


A01+A 2 


K ' - 4 b l + b 2 C 2^ 1+ X 1 ] + “2 b 3 + “3 b 4 + 4V, 

+ "2 3/2 [ b 6 C A 0 2+ x 3 + b 7 C CT -A0 1+ X 4 ] + “l' 1/2 “2[ b 8 C A0 1+ X 5 ] 

+ “i 1/2 «3[ b 9 C A0 1+ x 6 + b lO% 3 +A0 3 +X 7 ] + a i a 2 b ll + 4 b 12 

+ a' b 13 + or' 7 b 14 C o+x ^ + o-'or' 7 + b 16 C A0 2 +X 1() ] 


+ a' b 1? + a{or'[b lg + b 19 C 2A 13 +X 11 

+ « 2 ' 1/2 or 3 [ b ^ c .. + b„C 


21 A0 2 +X 12 22 4 13 _ CT+A 0 3+Xi3 


] + Qr 2 a 3 b 20 

] 


(50) 
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bj ( j = 1 > ***22) are known constants and stands for cos x. 

The detuning frequencies retained in Eq . (50) have the following mag- 

2A£ x 2(1 - m - u^) = -.05878 

2A0J -» 2(1 - ~ m 2 - u^) = .08242 

A0 9 = 0 - 3oj dt (1 - .0042 - 3uO * .10207 

4 4 (50a) 

<7 = ~ "* " ^^2 = -06086 

A^ u)-^ - 1 = -.04541 

Aj3 + A<Z> 3 -* - .04541 - .0042 = -.04961 

a - A0 X -» .06086 - .04121 = .01965 

and 


- a + A0 3 -» -.1105 

The terras containing A0 arise from the lunar eccentricity. 

As can be seen from Eqs . (50a) no terms with frequencies larger 
than .12 have been retained in the expression for K 7 . Although this 
choice of cut-off frequency appears at first sight rather arbitrary, 
it can be argued here that for higher frequencies the resultant de- 
tuning would not be narrow enough to introduce the very small divisors 
which usually lead to divergent solutions, and that consequently their 
omission should not materially affect the overall features of the re- 
sultant particle motion. 

The large number of frequencies which still are left in K 7 pose 
considerable difficulties in the way of a straightforward analytical 
treatment. To enable one to carry out nonetheless a reasonably mean- 
ingful analysis of the effects of internal resonances and of the solar 
perturbation, it was found necessary to reduce the number of admissible 
resonance peaks still further. This was accomplished by disregarding 
for the present time from further consideration all the terms which 
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arise from the lunar orbital eccentricity. While this step does tend 
to restrict the present analysis to encompass only circular lunar or- 
bits, it manages to reduce the number of detuning frequencies left 
down to 3. For this number of resonances an analysis can be carried 
out . 

Eccentricity terms could perhaps be reintroduced at a later time, 
possibly by means of an additional perturbation of the variational 
equations which result from the present circular orbit analysis. A 
possible shortcoming with such a scheme might be that it would prob- 
ably lead to a set of parametrically excited linear differential equa- 
tions which would not be readily solvable. 

Another somewhat different approach might be attempted, if we re- 
call that the elliptic 3-body problem (no solar perturbation present) 
admits as a solution an elliptic particle orbit around L^. This el- 
lipse is identical to the ellipse along which the moon appears to move 
relative to an observer moving with constant circular velocity along 
the moon's mean circular reference orbit, but rotated 60° with respect 
to it. Stated another way, the particle's motion is synchronized with 
that of the moon, but takes place 60° ahead of it. Variational equa- 
tions for these orbital elements due to the solar perturbation could 
then be set up and hopefully solved. 

The above are just two of the many other different approaches 
which might have to be explored in greater detail before the more gen- 
eral question of stability of motion could be satisfactorily resolved. 

In the present dissertation however, we shall hereafter confine 
our attention only to the case of zero lunar eccentricity. 
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IX. THE LONG PERIOD HAMILTONIAN FOR e = o AND 
THE ELIMINATION OF TIME t 

For the case of e = o the expression for k' shown in Eq. (50) is 
reduced to the simpler form given in Eq. (51) below. The numerical 
values of the coefficients b, and the phase shifts X, are determined 
after one performs all the tedious algebraic manipulations similar to 
those briefly demonstrated in Appendix E. There results 

K' - {.1266a' 2 - 6.0000^2 + 

- 29. 04a' 1 ^a^ 3 ^ 2 cos {.06086t + + 3^$' + 14.2°"J 

+ «'»'{. 09316 + .08608 cos 2 - .03934 sin 2^^ 

+ .7554a 2 'a' - .002231 a' 2 } - {.005394a' + .008208a' 

int 

+ .02685a' cos [.05878t + 2^8' + 29.4° + 2e ' - 2c] + .004193a 'j 

(51) 

where 

\ 3 = ^(t + e') - (t + p') = - .0454 it + u) 1 a 1 - b' 

The first bracket contains all the internal terms, while the sec- 
ond bracket includes all the external (solar) terms. The long period 
contributions to the coplanar (a^.a^) terms resulting from the periodic 
parts of the indirect p(t) and u(t) terms in H' were found to cancel 
exactly the indirect periodic terms generated by the linear forced re- 
sponse x q and y Q of Eq. (38). The external terms displayed in Eq. (51), 
which are left after the above cancellations, stem from the contribu- 
tion of the indirect constant component -.00093 in p, from the direct 
2 

(m ) terms in H, and from the forced responses x 2 and y 2 of the 
, . mm 

linear system. 
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Equation (51) shows that the dependence of K 1 on tine t cooes 
about through the presence of three distinct slowly varying trigono- 
metric terms with frequencies .06086, .09082 and .05878, all of which 
are of o(m) . Since the same trigonometric functions also depend on 
various combinations of the three angular variables B^(i = 1,2,3), 
the possibility suggests itself to eliminate the explicit presence of 
t by means of a suitable redefinition of the 0* so as to absorb the 

time dependent terms. Such a transformation would result in a new 
* 

Hamiltonian K which would not depend explicitly on t. 

This absorption of the time terms is accomplished by means of a 
coordinate transformation to a new canonical set of variables a* and 
0* as indicated below. 

We define B* via 

2 0* = ,05878t + 2 uj x b' + 29.4° - 2e + 2e ' 
or 

0* = ,02939t + u^b' + 14.7° - c + f ' (52) 

* 

The conjugate momentum ot ^ is obtained by the introduction of a 
generating function defined as 

J l * «t[* 02939t + + 14.7° - C + f 7 ] (53) 


so that 


bJ 


1 




(54) 


For the definition of 0 2 we use the trigonometric argument 


,06086t + u^e' + 3w 2 0 2 9 14.2° 
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and substitute for 8 ^ from Eq, (52). This leads to the expression 
[.0314*1- ■+■ 4 e f' ,5°J 4 0* 

* 

which suggests that 0 ^ be taken as 


0 2 = .03146t + 30^02 + C - e' - .5° (55) 

Use of a second generating function 

J 2 = ®2[* 03146t + 3 u 4 2 P 2 + f - - 5 °] (56) 


gives for the conjugate momentum q? 


* 2 


(57) 


The expressions for 0^ and can be obtained in a similar fashion 
with the aid of Combining first the cosine and sine terras 

.08608 cos 2i 13 - .03934 sin 2 Aj 3 = .09464 cos + 24.56°] 


we find that 


Bj “ ,074801t + cu 3 b' - ( + e' + 2.42° 


(58) 


and after introducing a generating function we obtain 


* 3 / 

3 UJ 3 3 


(59) 
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Letting J = and noting that J = J(0 # ,a*,t) we de- 
termine the transformed time independent Hamiltcmia K from the re- 
lation 


K* = K'<E>V) + ^jg* > C) 

ot 


(60) 


Substitution for a ,3 in terms of or ,0 in Eq. (51) and use of 

it 

Eq. (60) results in the desired expression for K : 


* f *2 * * *2 *1/2 *3/2 

K = |.1154<y 1 - 5.10^2 + 3.059a 2 - 23.97^ 1 ot 2 c 


* * 
B l +B 2 


vfe^-B*) 


* * 
*1*3 


* * 
* 2*3 


.0022 3 lor* 2 + .02939a* + .03146a* + .074801a*} 


int 


{.€ 


+ *| .004193a* - .007336a* - a*[j 


005149 + .02563C 2 * I 

P 1 ext 


*]} 


(61) 


where the notation - cos x has again been used for convenience. 
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X. ANALYSIS OF THE INTERNAL COPLANAR MOTION 
i. SiflFLlFlUATiUN OF THE HAMILTONIAN 

■k 

The analysis of the motion governed by the Hamiltonian K of 
Eq. (61) is made easier, and a greater amount of physical insight is 
gained, if we treat at first separately the internal terms contained 
in the first bracket. The modifications required by the presence of 
the second, external, bracket are then taken up later. 

Let us write for convenience 

* * * 

K = K. + K (62) 

1 e 


where 

* 

K. = all the internal terms 
i 

* 

K = all the external terms 
e 

and confine our attention in this and the next section to the Hamil- 
* 

tonian K . . 

l 

It would help matters appreciably if we could eliminate also for 
the time being the coupling which exists between the out-of-plane and 
coplanar terms. 

This elimination can be accomplished by a suitable choice of ini- 

* 

tial conditions which result in or- 0, provided we have reason to be- 

J * . 

lieve that a physical motion in which or^ does not depart much from its 
initial small value can in fact exist. 

The resultant coplanar type of motion can be maintained as long 
as the nonlinear coupling with the out-of-plane terms does not lead 
to an appreciable transfer of energy from one mode of motion to the 
other . 

In the next section, where we consider the out-of-plane motion, 
this situation will be shown to hold true. 

* 

On the basis of the foregoing we shall neglect here all the or- 
* * 
terms in K , which leaves us with the 2-dimensional Hamiltonian 

given by 
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K* 2 = . 1154<** 2 - 5 . 1cr * a * + 3 . 059»* 2 


+ .02939o* + .031460* 


,, *1/2 *3/2 

23 ' 97a l a 2 C 0*+B*= 


( 63 ) 


Since t is not explicitly present in the latter can also be 

treated as a constant of the motion. 


2. INVARIANCE OF THE DIFFERENCE q , 1 - AND BOUNDED MOTIONS 
* * * 

The presence of B. and 0 in K. ? occurs only through the combi - 
nation From this one readily sees that 


SK 


i2 


SK 


12 






which implies that 


(64) 


and after integration results in the additional coplanar integral of 
the motion 


«* ' a 2 = D 1 = * l D l^ < 65 > 

Unfortunately , this last integral does not provide any bounds on 

* * 

the magnitude of the coplanar displacements, inasmuch as or^ and 
are not prohibited by Eq. (65) from growing individually as long as 
their difference remains unchanged. 

On the other hand it is clear that the validity of the present 
fourth order theory would cease to hold long before the cr’s have grown 
to very large size, and that additional higher order terms in H would 
have to be included in the analysis. Equations (64) and (65) are of 
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great use in those cases when cr. and a do not grow without limit. 

1 2 * 

Let us consider now the question of boundaries of or . From 


&)’ ■ ( jf) ■ ” 

7 it *7 r it It? it it 

= 23,97 » ^2 " L K i2 ' + S.lffjOfj 


- 3.059a* 2 - ,02939a* - .03146a* 


f 


( 66 ) 


We now introduce the new variable 


« ■ 

and Eq. (65) into Eq. (66), which can then be written in the form 

^23. = f <0 - T1 (O 


where 


and 


( 68 ) 


il/2 


± [V(£ + 1)] for V l > 0 


[* 3 <* - 1>1 


, 1/2 


(69) 


for < 0 


- ^.2028 - - .0801 C 2 

^.2028 + - 0801 C 2 


+ constant > 0 


+ constant < 0 


(70) 
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The constants in Eq. (70) denote the value of T)(o) and are re- 
lated to the value of the Hamiltonian 1C 2 * 

The points at which the 71 curve intersects the + or - branch of 
the f curve 


■n - ± f (7i) 

correspond to points at which v = 0 and, by Eq. (65), also » = 0. 

^ 2 2 ^ 

Reality of the particle motions requires that f s* TT, 

The gradual changes of the motion of the physical particle in 
the xy space can be described by observing the motion of a representa- 
tive mathematical point along a given curve 71 in a plane in which f 
and n are plotted as functions of 

If the T! curve intersects both branches of the f curve or inter- 
sects the same branch at two different points, then a* and d* will 
have finite values at intermediate points on T\ t which tend toward zero 
as the representative point approaches the f curve. The sense of mo- 
tion of the point is reversed every time one of the branches of f is 
reached, so that the point continues to travel back and forth on a 
given 7| curve between its points of intersection with f. The turning 
or extremal values of the momenta a are thus fixed by the values whic 
£ assumes at the points of Intersection of 71 with ± f . 

The geometry in the f(£) and T](£) plane is shown schematically 
in Fig. 6. 

The curves in Figs. 6(a) and (b) represent bounded particle 
trajectories in the xy plane. The tangency points P 2» P 3 at 

H " V - * 

and (72) 

“l = "Z " 0 
* * 

are equilibrium points in the (or^,a 2 ) plane, and with the aid of 
Eq. (66) can be shown to correspond to coplanar periodic particle 
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orbits. Equation ( 66 ) requires that R* + R* = nir, which can also be 
written in the form 


tu, 3' + 3u> o 0' + ,06086t - nil + 14.2° = 0 

A A d- d. 

Reference to Eq. (51) shows that this condition eliminates the 
detuning term due to coplanar coupling and indicates conmensurability 
of the internally perturbed coplanar normal frequencies tii^ = ^ p ' 

and 0^ - cd^ + Periodicity of the coplanar particle orbits 

follows from here. 

The equilibrium is stable at point P 2 and unstable at point P 3 , 
where small disturbances may cause a displacement to a neighboring curve 
such as 7 ^ which causes divergence of the physical motion. 

Transition from stability to instability occurs at points where 

Tf = ± f' (73) 

When >0, f" does not change sign as can be seen in Fig, 6 (a), 

and from this follows that all the periodic particle orbits for which 
* * 

°1 > « 2 would of the unstable kind. For the case D 3 <0, f" does 

change sign at some value £ > 1 and we note accordingly the presence 

of one stable and one unstable equilibrium point along the +f branch 
in Fig. 6 (b) . 

3. THE PERIODIC MOTIONS 

When one solves the tangency Eq. (72) for the value of £ which 

corresponds to every choice of D- , one can obtain an or. for every or„ 

* * 1 12 
found. In the or^ versus a ^ plane this solution curve represents the 

so called "tangency locus" of equilibrium values of o* and o* which 

designate periodic particle orbits. This curve is presented in Fig. 7, 

where we have chosen as coordinates the quantities 10 /a, and loAy* 

* * ’ 1 12 

(^1 anc * ^*2 are *‘ n ^ act associated "action variables") . On this 
curve we have set the angular variables 
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*12 ' B l 


(74) 


This plot is seen to consist of two distinct branches which con- 
nect at the point (1.12,0). The left hand branch consists of a seg- 
ment of stable periodic orbits which is followed by a segment of un- 

* 

stable periodic orbits. On both segments A.« =0. The unstable branch 

iz * 

on the right hand side of (1.12,0) requires a A^ * R- 

Two more curves passing through (1.12,0) and consisting of left 
hand and right hand branches are also shown in this figure. The lower 
(solid) curve denotes the loci of intersection points P^ of T|^ with the 
second f branch. (For added clarification small inserts of the appro- 
priate geometrical situation described by Fig. 6 are also displayed 
here in connection with specific segments of the curves.) 

The dashed curve lying close to the P_ locus represents the inter- 

J * 

section of Tl^ with the £ axis. On this curve = The values 

of a£ 2 which allow stable motions to exist in each one of the domains 
1 _ XV which are separated by the above curves are indicated in the 
figure, and also by shaded regions in the small inserts from Fig. (6). 

The axis a * = 0 represents the locus of stable periodic particle 
orbits which are traversed with a mean angular frequency differ ing but 
slightly from . The stable periodic segments along which lojla 2 » 
10 marks those particle orbits which are traversed with a mean fre- 
quency close to ( 1 ^. 

Curves of = constant, intersecting all the above curves are 
also displayed for a few selected values of D^. 


4. FREQUENCIES OF THE PERIODIC MOTIONS 

In the present nonlinear treatment, except for the special periodic 
motions mentioned above which are described only by one single normal 
mode, all the other periodic particle orbits are generated by a super- 
position of both normal modes. Periodicity here is achieved as the 
result of an adjustment of the natural frequencies via the nonlinear 
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coupling which occurs between the two modes and which makes them exactly 
commensurable . The resultant frequency shifts Auj^ and Ad^ in the orig- 
inal undisturbed frequencies d^ and uv, , lead to normal modes with modi- 
fied commensurable (3:1) frequencies x* and w* 


Wf + = 3^ = 3(u^ + Alu,) 


(75) 


This point was also raised earlier in the discussion following Eq. (72). 
The orbital period T is determined by the slower mode 


m 2TT 
T = — 
/ 

*2 


(76) 


During this time T three cycles of the faster mode are completed. 

E. Evaluation of the Frequency Shifts for Periodicity 

For every point on the "periodic motion" curve of Fig. 7 there 

* * 

exists a unique set of equilibrium values and o^g. 

The shifts Ad)^ and can be estimated by writing 

<^(1 + p')t = 3(^(1 - B^t (77) 

and solving for and ^ TOm the relations 


— — - .02939 + ui^' 


i2 


= .03146 + 




3 ^2 


(78) 


evaluated at cy^g and From here one finds 
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• / * * *-l /2 *1/2 

Axi = Vl ' = .2308 c 1e - 5.to 2£ * 


^2 = 


x / _ i * , n - 0 * . * 1/2 * 1/2 

2 B 2 l-7a 1E - 2.039 q 2£ ± <* 1£ » 2E 


( 79 ) 


where the upper sign corresponds to ^ 2 = 0 and the lower to A^ 2 = it. 
* 

F. Variation of a s Near Equilibrium Points 

For small disturbances from the equilibrium points P 2 and P^ the 
time dependence of £ can be approximated by means of a Taylor series 
expansion of f and T t around the equilibrium points. 

Letting 


« e > £ r + \< (« - w 2 *; + ... 


and 


’E E 
2 _ 


1 ‘ \ + <« - «eW + r: <« - V 2 * + •• 


(80) 


and recalling that = fg, = fg, we can combine Eqs. (68) and (80) 
to obtain (after approximating 23.97 by 24 for convenience) 


24 


24|D^l dt ^ Q ~ 11 “ yU E U f E ' ^ ^ ‘ £] 


•E* 

(83) 


whence 


e t 24 M/|f E K f E - Tfi> £ W 

C - 4 E = e 

For a stable point such as P 2 in Fig. 6(b), we have f* < T]£. 

This makes the exponent in Eq. (84) imaginary of the form icu^t and 
indicates a slow oscillatory variation in o. For an unstable periodic 
point such as P^, f* > which leads to an exponential growth of a with 
time . 
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A few representative values of the period = 2ff/u^ are indi- 
cated alongside the stable periodic segment in Fig. 7. 

Tut: developments of c'ne present section can now be summarized by 
means of the following general conclusions: 

1. On the assumption that the out of plane terms do not couple 
strongly with the in-plane terms (which will be proven later) 
it is possible to reduce the problem to an essentially 2 -di- 
mensional one. 

2. Initial conditions which lie on an T| curve located to the 
left of the limiting curve of type Tl^ will lead to bounded 
motions of the particle in the xy plane. 

3. Depending on whether the J\ curve is tangent to the f curve 
at a point such as P^ or P 2> periodic particle motions of 
an unstable or a stable type, respectively, may exist. 

4. The periodic orbits generally result from a superposition 
of the two normal modes of vibration in which the nonlinear 
coupling has brought about commensurability of the basic 
frequencies by means of appropriate frequency shifts. For 
special Initial conditions, periodic particle motions con- 
sisting of only the faster normal mode may exist. 

5. In the neighborhood of stable equilibrium points of type P- , 

it it 

the momenta and perform low frequency bounded oscilla- 
tions in time. Near unstable equilibrium points of type P^, 
the o*'s will tend to grow exponentially with time, which 
results in a large growth of the particle’s motion in the 
physical xy plane. 
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XI. ANALYSIS OF THE INTERNAL OUT OF PLANE MOTION 


The analysis of the out of plane motion is rather simple and 

straightforward compared to the coplanar analysis of Section X. We 

★ * * 

shall investigate the coupling of Qr_ and af- in the region where or« - 0 , 

•* j i i 

by neglecting the terms in Eq. (61). 

The reason for this particular decision is the result of hind- 
sight, based on a prior preliminary study of the external effects on the 

coplanar motion which disclosed the presence of a stable equilibrium 
* * 

point / 0, <*2 = 0* for Sun perturbed problem. This will be dis- 
cussed in more detail in Section XII. 

k 

Let us denote by F the internal terms left in the Hamiltonian 
* * 

K of Eq. (61) when all c ? 2 terms are dropped. We have then 


* 

F 


.1154<* 


*2 


! + .090350^0^2 (g* p*) + .0889 3af^ar 3 


- ,002231c** 2 + ,074801<** + ,02939c** 


(85) 


* * * 

Let - 0 3 . 

From Hamilton's equations we then obtain 


or. - 2 • .09035of 1 O' 1 S„ * 

1 1 3 Zfl 13 

®3 = - 2-.09035or*or*S 2 ^^ 


( 86 ) 


This leads to the new integral of motion 


* 



= D 


2 


(87) 


with D 2 > 0. 


.* 


As we did before for or 0 , we can now write for or. 


.*2 
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2 

(i*) = A.(.09035) 2 a* 2 <y* 2 - 4| _ F* - ,1154a* 2 - .08893a*a* 


.002? Tlrv* 2 _ mo™-* 

”3 - •‘"H 


*T 

“3j 


(88) 


We introduce the auxilliary variable £ 


* 3 = D 2 


(89) 


and end up again with the equation 

\2 


(n&J ■ < 2 -* 


(90) 


where this time 


f - ± c,a - «,) 


(91) 


and 


f'(o) = ± 1 


(92f 


- 5 - - 1.277(1 - 4 ) 2 - .98434,(1 - 4 ,) + .02469E 2 

.09035D, J 3 3 ’3 


.32534 .50259 

Do “ D_ ^3 


(93) 


At the origin, the first and second 71 derivatives are 


Tl , (o) = 1.5703 - *50259 > ^ depending on 


(94) 
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H"(o) = Ti'($ 3 ) = -.5366 < 0 (95) 

T,'(o) <0 if D 2 < .3201 (96) 


and 


T.'(o) - - 1 if D 2 ^ .1955 (97) 

The magnitude of the slope Tl' will determine the time history of 
^ In particular, if t/ <r - 1 then A^ 3 will exhibit a circulatory 
behavior, while a value of -1 ^ 71 7 < 0 would lead to a librational 
behavior. 

An upper bound on V can be established by making a reasonable 
estimate for an upper value of D 2 * Such an estimate can be furnished 
from some of the mathematical and physical considerations which underlie 
the present analysis. 

From a mathematical standpoint it is clear that in the binominal 
expansions and truncations used to obtain the expression for the Ham- 
iltonian H(x,y,z,t) of Eq. (21) it was assumed that x,y,z were small 
compared to unity. 

From a physical point of view it is not clear that relatively 
large displacements away from the Moon would necessarily invalidate 
the conclusions of the present analysis, but the large accelerations 
resulting from large displacements towards the Moon or Earth could not 
be tolerated. 

If we assume that the displacements should be limited to values 
x,y,z < .5 (say) then for the excitation mode ^ we -an obtain from 
Eqs. (36) 


/®f < 4 = - 25 

i .e . , < .0625 
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ami also 


a* 


< .0625 


so that for these limits 


D 2 -- .1250 <* .1955 

The slopes of all ti curves are thus steeper than f 

which follows that every V. curve will intersect both ± : 

* 

giving rise to a circulatory motion in A. ~ as indicated 



'(£ 3 > fran 
E branche s , 
in Fig. 8. 


Fig. 8: Geometry in (*.£3) and apace. 
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Since }Tj , |>|f / | no equilibrium points with a ^ ^ 0 can exist, 
and consequently no periodic orbits in xyz space result from the non- 
linear coupling of modes 1 and 3. 

The actual slope of any T curve would depend of course on the 
value chosen for , subject to the limits mentioned earlier. 

We may choose for example a representative value of at = .006 

■jUt ^ 1 

(say) and assume to be of the same magnitude (this a. is very close 

* i 

to the actual coplanar equilibrium value of a^ in the externally per- 
turbed case discussed in Section XII) . Then we have 

D 2 - 2a* = .012 (98) 

This results in a slope 

o' - 1.57 - = -40.3 = tan _1 e 

or (99) 

e ^ 90° 

In other words the 71 curve intersects the axis nearly vertically, 

from which one concludes that constant; thus, there is hardly any 

, J * * 
energy interchange taking place between a^ and a^, which shows that 

the out of plane coupling is not very important in this problem, and 

that the motion is dominated by the coplanar coupling. 

That the out of plane coupling does not introduce any instabili- 
* * * 
ties when « and is close to its equilibrium value ^ .006 

could also have been deduced directly from the expression for F in 

Eq. (85). For very small a- it is sufficient to consider only the 
* J * 
terms linear in a-, and to evaluate the coefficients at a, .006. 

J * 1 
The resultant Mathieu type Hamiltonian F indicates a parametrically 

excited motion. Such Hamiltonians are discussed more fully in Appendix F, 

(in connection with the solar effects on the coplanar motion examined in 

* * 

Section XII) but under the assumption that the values of and are 
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to remain very small (i.e., coplanar particle motions for very small 
perturbations from rest at L^) . 

If one applies the results of Appendix F to the present situation. 

* 

and notes that the coefficient of Q' 0 C 0/q * is smaller than that of 

one readily concludes that the parametric resonance present in the 
out of plane motion does not lead to instability. 
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XII. ANALYSIS OF EXTERNAL EFFECTS 


1. DETERMINATION OF EQUILIBRIUM POINTS 

For a complete analysis of the motion in the presence of the ex- 
ternal solar effects, one must retain the complete expression for K* 
given in Eq. (61). 

From the discussion of Section XI it was seen that the , 0 * in- 
ternal coupling did not lead to any measurable transfer of energy from 
the out -of -plane mode to the coplanar mode of motion, while from Sec- 
tion X we have established the existence of an appreciable coplanar 
coupling effect. 

The major long term solar effect causes mainly an excitation of 
* * 

the or^ mode. The or ^ mode does not experience any external excitation 
to the order of magnitude of the terms retained. This latter state- 
ment follows from the developments presented in Section XIII. 

If a stable motion in the presence of the Sun is possible in which 
* * * 

0 * 1 , 0^2 a nd or-j remain small, it would suffice to retain only linear terms 
in K* in order to determine long term effects. To linear terms we have 
the simpler Hamiltonian 


.02425^ + .02412<* 2 + .07899^ - .02563^0^* 


( 100 ) 


which is of the Mathieu type, as indicated in Appendix F, and leads to 

* 

parametric resonance in the motion. 

Since .02563 > .02425, the stability criteria of Appendix F indi- 
cate that the motion falls into the unstable region of the Mathieu 

plane, and that therefore to linear terms no motion can exist for which 
* 

remains very small. 

From a physical point of view this means that the libration point 

L, is not stable with respect to small perturbations, when the solar 
4 * 
force field is included, and that the higher order terms in must be 

retained in any analysis. 

The lack of stability exhibited by the linearized Hamiltonian does 
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not preclude the existence of equilibrium points in the o' space for 

the complete Hamiltonian. In view of the negligible effect of on 

the cepler.er motion, it uf iiiLeiesc co Look for equilibrium points 

k k k 

for ofj = 0. Such points in the plane are determined by look- 

ing for solutions to Hamilton's equations of the form = 0. 

Once such points are located, it is then necessary to investigate 
the type of equilibrium which exists there, and to identify the stable 
ones . 

This search is more easily carried out if one switches over to a 

k k 

set of normal canonical coordinates (Q,P ) defined by 


l<\ 

q' 


1 * 2 / 


0 \ /v ' 




C fi* 

B 1 


WJ 


( 101 ) 


After setting a = 0, the two dimensional part of K , which we 

, * J 
denote here by ^ » becomes 

>; ■ ^ ('V * *?f - ¥ * <?) * ^ if? * <* 

- (« • «M)« ! * ■>?) * ™ If? * < 2 ) 

• ^ I’? * 1?) ■ {’? - •>?) 00 !) 

k k 

The equilibrium points (Q e »P e ) are obtained from the solution of 
the equations 


*2 

2 
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K! 

= $ 

fK* p *r\ 

.* 

o 

H 

-;c 

O 

V 


= 0 


( 103 ) 


From Eq. (103) we have 

0 = .1154P*(pf + Q* 2 ) - p*(p*2 + q *2) 




23.97 *( *2 *2\ * 

4 ^2\ z + Q 2 ) ' ■001379P 1 


- ^ir~ ( 3P 1 P 2 2 + P 1 Q 2 2 ■ 2P 2 Q 1 Q 2:) + - 02412p 2 

0 = ,U54Q*(p* 2 + Q ; 2 ) - 5^1 Q *( P * 2 + Q* 2 ) 

+ Q*(p* + Q*) + -04988Q* 

0 * - ¥ ¥ P 1 2 + ¥) + 3.059Q*(p* 2 + Q* 2 ) 

* *2\ * 
Q 1 Q 2 ) + “ 02412Q 2 


(104a) 


2 Q* 


K 2Q * 


(104b) 


(104c) 


23 


.97 / * * * * *2 

r 1 K P 1 P 2 - W - 3Q - 


(104d) 


Equations (104c) and (104d) are identically satisfied if we chose 
★ * 

Q le = Q 2e = °* For convenience we shall therefore restrict our search 
to those equilibrium points for which 


* 



Q 2e = ° 


(105) 


293 


THREE-DIMENSIONAL, NON-LINEAR RESONANCES 


For the above Q 's Eqs. (104a) and (104b) give 

. 1154P* 3 - 2.55P*P* 2 - 5.810P* 3 - .001379P* = 0 (106a) 

- 2.55P* 2 P* + 3.059P2 3 - 17.43P*P* 2 + ,02412P 2 « 0 (106b) 

One equilibrium point can be obtained by setting P 2e ^“ ^ (which 
automatically satisfies Eq. (106b) and then solving for P le from the 
relation 


,1154P 1 - .001379 = 0 


(107) 


or 


which corresponds to 


.1093 


v* = .005975 
ie 


(108) 


The above value of a* is the one which was used in earlier sec- 
tions when representative numerical values were used. 

The first equilibrium point, which we denote by Ej, is thus speci- 
fied by the coordinates 

V < ■ ■ 13 - P 2 “ P 3 = 0 “t " - 005975 

P* = .1093 »2 = 0 (1 ° 9) 

* 

“3 = ° 

Another equilibrium point can be found for which P 2 t 0, all other 
homogeneous coordinates remaining the same as for point Ej. The values 
of P* and P* result from the solution of the algebraic equations (106a) 
and (106b), after P^ is factored out from the latter. The coordinates 
of the second equilibrium point E n were found to be 
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* * * * 

E ir Qi = Q 2 -Q 3 = p 3 = 0 


P = .1106 


F 2 = -.003675 


.006116 


(110) 


<>2 ~ 6.753 x 


,„-6 


The two points and E^ were the only ones readily found for 

the present simplified conditions. A machine search of the complete 

set of Eqs. (103) might reveal the existence of additional roots. The 

periodic elliptic particle motion of mode close to uu^ corresponding 

to conditions at E^ has a semimajor axis of about 60,000 mi and a 

semi-minor axis of half this value. These values were determined by 
_2 —2-1/2 _ — 
computing r^^ = [x + y J taax where the modes of x and y of Eq. (36) 

were used, and the maximum determined with respect to It can 

be shown that this requires that 8.42252^^ + ^•^ 2 -^ 2 ^ 18^+247 = 0 

and results in a value tu. BC 15.62°. The dimensionless expression 

11 . 1/2 * 

for r then becomes r > 3.2a., , and at a, — .006 amounts to 

max max 1 1 

roughly 3.2 J. 955:006 x 2.4 x 10 5 = 58,128 60,000 in round numbers. 

In a similar manner one finds for the maximum dimensionless dis- 
placement in mode the semimajor axis r^x ^ 9.1^*2~and in miles 
r max = 9 ■ 1 / 3uu 2^2 _ * 2,4 * 1()5 = 9 *V.8937A/2 • 2.4 x 10 5 miles. 

It is of interest to observe that this result indicates the par- 
ticles mean motion is synchronized with that of the Sun such that 
their angular positions coincide closely whenever the particle crosses 
one of the axes of the ellipse. 

* * . , 

We recall that at equilibrium - 0 and hence - MT with 

n = 0,1***. For n = 0, Eq. (53) gives 


P = 0 = .02939t + b[ + 14.7 - € + e' 


and from here 


- V 


,02939t - 14.7 + € - f' 
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When the particle crosses the major axis we had = 15.62, 

and from the comnensurabil ity of angular velocities at E^, - .02939) 

= 1 - tn. Substitution above gives 

15.62 + 14.7 = 30.32° = (1 - m)t + € - €* £ 

as defined by Eq. (B-9). Equation (17) then shows the Sun to be lo- 
cated 30.32° below the x axis, and therefore closely aligned with the 
major axis of the particle's orbit. 

2. STABILITY OF THE EQUILIBRIUM POINTS 

The stability of the slow variations around the above periodic 

equilibrium motions in the xy plane can be determined by setting up 

the expression for the variation 6K which results from taking small 

displacements 6Q* and SP around the equilibrium values Q ie = 0 and 

P*. Clearly, since E_ and E are equilibrium points, the coefficients 
6 I ^ 

of the linear terms in §P must vanish, and on then obtains in three 
dimensions 

SK = ,02885P* 2 [66P* 2 + 26Q* 2 + •••"]- 1 - 275 [ P le( 8P 2 2 + 6Q 2 2 ) 

* * * * * 2 / *2 ,„* 2 \ . “1 
+ 4P le P 2e 8P l 8P 2 + P 2e( 8P l + 8Q 1 ) + "’} 

+ ,7648P* 2 [66P* 2 h 2^Q^ 2 + •••] - .00039^P* 2 (f,P* 2 + SQj 2 ) 

+ 4P le P 3e SP l 6P 3 + P 3e( SP l 2 + 

+ .04958[p* 2 6P* 2 + P* 2 6P* 2 + 2P* e P^(26P*6P* + 6Q*4Q 3 )] 

+ ,2113[p* 2 (6P* 2 + 6Q* 2 ) + ^2e P 3e 6P 2 8P 3 

+ P 3^( 6P 2 2 + 8Q 2 2 )] ' •0° 0 5578[p^(66P* 2 + 26Q* 2 )] 

+ .0395[6P 3 2 + 6Q* 2 ] - 5 .810^P* c P* e (36P 2 2 + 5Q 2 2 ) 

(con ' t on next page) 
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+ l?2e( 36P l 6P 2 ' 6Q 1 6Q 2) + - 012126 [«P* 2 + 6Q* 2 ] 

+ ,012062 j^fiP^ 2 + 5Q* 2 ] - ,012815[6P* 2 - 6Q* 2 ] (111) 


Applying expression (111) to point E^. results in 

MC* = .0013806P* 2 + .025636Q* 2 - .0031746P* 2 - .0031746Q* 2 

+ .040086P* 2 + .039496Q* 2 (112) 


Since for every value of i = 1,2,3 the coefficients of 6P. have 

* 2 , * 1 

the same sign as the coefficients of 6Q. (i.e., 6K is either positive 

1 * * 

or negative definite irrespective of the signs of 6P or 6Q ) we can 
conclude that point E^ is stable for small disturbances in all princi- 
pal directions. The period of the slow variations in SP^^Q-^ is approx- 
imately 83 months. 

* 

It is more convenient to retain only coplanar terms in 6K for 
the determination of stability at E^. We then obtain the expression 

6K* = .00141 16P* 2 + .025636Q* 2 + .0018386P*6P* + .0036526P* 2 

+ 7.847 x 10" 5 6Q*6Q 2 - .0011506Q 2 2 (113) 


If we now assume P. and Q- to remain unchanged while we intro- 
* 1 * 1 

duce variations 6P^ and 6Q^ we have 

6K* = .0014116P* 2 + .025636Q* 2 (114) 


where 6Q 2 = 6P 2 =0. 

Thus 6K is positive definite for variations in the first set of 
* * 

coordinates and hence 6Q 1 and 6P, remain bounded. 

1 1 * * * 
Repeating the same steps for 6P 2 and 6^ while keeping P^ and 

fixed gives 
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6K* = .0036526P* 2 - .OOmOfiQ^ 2 <U5) 

where 5P, = 6Q. = 0. 

1 * L * * 
Since 6K is not definite for arbitrary choices of 6P_ and 6Q2 

* * 1 

we conclude that point is not stable in 6P2 and 6Q2* a nd hence is 
an unstable equilibrium point. The equilibrium for variations 6P^ 
and 6Q^ was found to be stable, which is in agreement with the find- 
ings of the last section. 

The above conclusion could have been reached also more rigorously 
in a somewhat lengthier fashion by writing down the complete system 

• ^ .St 

of first order linear differential equations for 6Q and 6P obtained 
from 6K* of Eq, (113), and examining the roots of the appropriate char- 
acteristic equation. We would find that 


l<\ 


1 0 

0 

.002822 

.001838 

1 * 
hi\ 

6Q* 


[ 

0 

0 

.001838 

.007304 

* 

6Q 2 

.* 

6P 1 

= 

-.05126 

-7.85* 10 -5 

0 

0 

* 

8P i 

U) 


|-7.85-10" 5 

-.0023 

0 

° 

K/ 


(116) 
s t 

A trial solution of the form e would lead to the characteristic 
equation 


S 4 + 1.282 * 10~ 4 S 2 - 2 .031 *10"® = 0 (117) 

which has one positive root because of the negative constant term. 
Equation (117) thus bears out the conclusions reached from Eq. (115). 

A simple geometrical description of the stable and unstable re- 
gions in the 6 dimensional P*,Q* space is of course not feasible. On 
the other hand it is possible to take advantage of the fact that the 
stable point is noticed to lie very close to the unstable point E n 
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It is thus of particular interest to determine the extent of the stable 

* * * 
region around E^, by expanding K up to cubic powers in 6P and 6Q 

around E . 

The intersection of surfaces of constant K with the (P 2 »Q 9 ) 
plane, for a value of = .11, is shown in Fig. (9). The dashed 
curve shows the separatrix which passes through and separates the 
stable from the unstable regions. 

In the physical xy plane, a point in the stable region gives rise 
to slow variations of the elements of the periodic particle orbit cor- 
responding to Ej. A point in the unstable region of the P* ane 

would lead to large particle departures from the equilibrium orbit, 
and thus indicate a possible divergence. 
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9: Stability Regions in the (P*,Q*) plane Near ^ 

Coplanar Equilibrium Points. 
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XIII. EVALUATION OF THE EFFECT OF THE RESONANCE CAUSED 
BY THE FORCED SOLUTION z 

We had alluded on page 23 to the fact that no forced solutions 

3 

in 2 , i.e. s z. had been retained since they are of o(m ) and would 
thus give rise to terms of o(m^) or higher in H when one went on to 
derive the long period contributions. 

A closer second look at the external z terms in disclosed 

the existence of a very closely tuned forcing term in the linearized 
out of plane z motion which could introduce perhaps small divisors 
in the solution for z and thus depress the order of magnitude of that 
solution. This would introduce another important long period term 
into the Hamiltonian K. The resonance in question arises for example 
from a term such as 

x z = -r? 0 « t sin i sin [1.0040212t + f] 
s s 13 2 o 

which would lead to a detuning of magnitude 

1.0040212 - 1 * .0040212 (118) 

This value would introduce a much slower term in K than any of 
the terms previously retained, and might conceivably require a redefi- 
nition of the angular variable introduced earlier. 

The developments indicated briefly below disclosed that the z 
resonance terms cancel each other exactly, and consequently do not 
contribute a term slower than the one already considered. No further 
modifications to the analysis of the out-of-plane motion of Section XI 
were thus required. The steps leading to the above mentioned cancel- 
lation were nevertheless found interesting enough to justify their 
inclusion here. 

The z portion of the external part of ^ of Eq. (21) was 
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For a coordinate system with its x axis pointing at the instan- 
taneous position of the Moon, the angular velocity components and 

Ti nr<» oiVAn hv 

-y " 


v = i cos 71 + Q sin i sin Tl 
x 

Vy = G sin i cos Tl - i sin n 


( 120 ) 


These are the same as Eqs. (B-3) except that T| o has now been re- 
placed by T = gnt + C - G and g = 1.0040212. 

The angular velocities G and i can be expressed in terms of 71, i 
and the solar acceleration component W normal to the Earth-Moon plane 
at the Moon's position, by means of the variational equations on page 
404 of Ref. 9, in which a corresponds to (r^) here 


. r 12 sln Tl 

n - w 

na sin i 


r cos 71 
1 = ■ • • -a W 


( 121 ) 


By our nondimensionalization convention na = 1, so that 

u = -12. w^sin T] cos 71 - cos 71 sin Tl"] - 0 (122) 


and the angular velocity id has thus no component in the y direction. 
One can also write for v 

x 


wTsin 2 71 + cos 2 T)J = (1 + p)W 
Dna Z L 


(123) 
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A suitable expression for W can be obtained by differentiation 
from the potential energy V g near the Moon. If we let x,y,z denote 
small displacements from the instantaneous position of the Moon, we have 


/ 






* 

- 

SV s 


a 

m 2 

3 h . F f 

1 2 

dz 

x,y,z =0 

dz j 


2 \13 14/ 

.13 

" 2 r 14 


dz 


13 


(x g (l + p + x) + y g y + z g z) 
j ((1 + P + x ) 2 + y 2 + A 


x,y,z =0 

= 3 m 2 -±J-+ o(m 5 ) = 3m 2 sin i cos £ sin (D - v 7 ) 
r 13 


( 124 ) 


To sufficient accuracy then 

u s: W = 3m 2 sin i cos £ sin (D - v 7 ) 

X O 

(o) 


( 125 ) 


To check if H(z) would in fact lead to the presence of small di- 
visors in the solution for z it suffices to check if H(z) contains 
slowly varying terms of frequency ,0040212 when we replace in it z 
and P by the homogeneous solutions z and P . 


H(z) = — m sin i^ ./Zor^ COB ^3 I s * 0 (1 ,00402 12 1 + €) 

+ J5 cos (1.0040212t + *} + ^ [cos ^ sin 


( 126 ) 
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Since 


ft - v * - £ — -1.CG4G2o.2l - « (127) 

we may retain in only the dominant resonance term 

u x “ ‘ \ ™ 8tn i 0 sin d- 0040212 * + 0 (128) 

(o) 

When Eq. (128) is substituted into H(z) of Eq. (126) and all the 
terms combined it is found that all the long period terms cancel each 
other exactly and only fast terms remain. From this one can conclude 
that the forcing function of the linearized z equation does not con- 
tain a resonance term which is close enough to introduce small divisors 

into the forced response z and thereby lower its order of magnitude 
3 2 

from o(m ) to o(m ) or less. 

Based on the foregoing we can conclude that the neglect of the 
contribution of z to the long period terms (SH/dz)z was consistent 

4 

with our convention of neglecting terms of order higher than o(m ) . 

This analysis shows that although the Sun has an appreciable long 
term effect on the changes in inclination of the lunar orbital plane, 
it has the same effect also on the orbital plane of the llbrating 
particle, with the net result that any relative long term out-of-plane 
responses vanish. Short period, fast, relative terms do not cancel 
out though . 
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XIV. SUMMARY AND CONCLUSIONS 

In the present dissertation, the 3 -dimensional stability of the 
motion of a particle near the equilateral libration points of the Earth- 
Moon system, in the presence of the Sun, has been investigated. 

Because the inclusion of lunar eccentricity would have introduced 
into the problem a larger number of internal and external resonances 
than could have been handled by the present method of approach, it was 
found necessary to restrict the stability analysis to a lunar orbit 
perturbed by the Sun but without eccentricity. 

Four major conclusions emerge from the present study. First, 
small coplanar motions near L^ or will grow large because of para- 
metric excitation by the Sun, as a result of nonlinear resonance. In 
fact, the growth of the energy in the faster normal mode of the linear- 
ized theory is found to be governed by a Mathieu equation. 

Second, the out-of-plane motion is not seriously excited by the 
Sun, and has a negligible effect on the coplanar motion, which is the 
dominant factor as far as stability is concerned. 

Third, a stable periodic coplanar orbit can exist in the presence 
of the Sun. It consists of a clockwise motion along the 1:2 ellipse 
corresponding to the first (or faster) normal mode, and has a semlmajor 
axis of approximately 60,000 mi. The external nonlinear excitation 
causes the mean angular motion of the particle to become synchronized 
with that of the Sun. Thus to an observer located at and looking 
continuously in the direction of the Sun, the particle would appear to 
move back and forth across his line of sight in the manner of a simple 
harmonic oscillator. The times of crossing of the line of sight coin- 
cide closely with the times at which the line of sight is aligned with 
the major or minor axis of the ellipse. 

Fourth, the presence of the internal resonant excitation, result- 
ing from the near comnensurability (3:1) of the two coplanar normal 
models makes the stability somewhat delicate. As a consequence, the 
semimajor axis of the second mode is limited to magnitudes less than 
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approximately 2400 mi. For larger values the motion becomes unstable 
and may result in very large displacements which would exceed the range 
ul applicability ul the present theory. 
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Appendix A 

SOLAR GRAVITATIONAL GRADIENT CONTRIBUTION 


Consider the term 


r 34 

J4 r 13 


of Eq. (7), and decompose into 


(A-l) 


r 34 = r 31 + r 14 = A ^ for sim P licit y) 


(A- 2) 


For (r^/r,,) « 1 we can expand l/r.^ i nto a Taylor series around r 


*14' 31 
as shown: 


31 


34 [ r 34 • r 34] 


T7? - 7 ^ + r i4 ■ v - — 


\ ( r 14 • r 14) ^ 1 


[X . X ] 1 


in 


[*•*] L 

r 14 = ° 


(A- 3) 


where 


V = 

a 

*31 

and r 31 - 

- r 13 

1 

A - 

vX 

A • I 


r 


“|3 

[a . a] 

[*• 

A J 1 

A • XJ 


(A-4) 
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and 


14 


[* • *j 


172 


"l3 ' *-14 
3 

r 13 


(A-5) 


where 


I = unit diadic 
Similarly 



Combining (A-l), (A-3) , (A-5) and (A-6) and neglecting the first 
term of the series, l/r^* which makes no contribution to the equations 
of motion, we end up with the last term of Eq, (8) which is the expres- 
sion of (A-6), and represents the solar gradient force near the Earth. 
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Appendix B 

THE EXPRESSIONS FOR p(t) AND u(t) FROM LUNAR TffiORY 


The expression, for p(t) is readily obtained from Eq. (1), p. 281 

of Ref. 7, after computing (a/r) ^ * 1 + p(t) = r 1? and retaining only 

2 iZ 
terms of o(m ) or lower. The term -.00093 of our Eq. (15) corresponds 

12 -1 

to - -g m in the series for (a/r) . The semimajor axis a is set equal 

to the reference length D in our notation. 

Derivation of the expression for u(t) requires a few more alge- 
braic manipulations. We shall make use for this of Fig. 2 (p. 13) and 
Fig. 4 (p. 38) of Ref. 8, which are combined for convenience in Fig. 

B-l , and also Fig, B-2 which shows the lunar orbital plane as viewed 
from above (i.e., looking in the direction of the negative Z axis). 

In order to facilitate the derivation we shall retain (in this Appendix 
only) the notation and symbols of Ref. 8 irrespective of the use to 
which some of the letters have been put in the main body of the present 
report. Where necessary, the corresponding letters in our notation 
will be pointed out. 

In dimensional symbols we now have 


■ ■ ( n + 0 + lj M + ° 8in 1 (H x Hi) 

e 

Hi = cos Vx - sln Vy 

e J e 

T z X Hi = 8ln Vi + cos Vy 

e J e 

= nt + c - ft 


(B-l) 


so that 


= ^n + i z + cos + 0 sin i sin T^J i x 
e 

+ sin i cos 1^ - i sin i 


(B-2) 
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The dimensionless form of results if we set n = 1 in (B-2) . 

Noting that i x > *-y > are parallel respective to the unit vectors 
— — — e e e 

i x , iy* i- z u£ our centered coorindate trame , we have 
v x = fj sin i sin + i cos 71 0 

(B-3) 

Uy = fj sin cos - i sin 
3 

both of which are of o(m ) or higher. The expression for car he 

obtained by taking the time derivative of the true anomaly v in either 

one of the expressions on p. 110 of Ref. 8 or Eq. (2), p. 281 of Ref. 7. 

This results in the coefficient of i ^ of our expression (16) . 

With the aid of Fig. (B-l) it is also relatively straightforward 

to determine the components of r, _ in the i , i and 1 directions. 

r 13 x’ y z 

We refer the reader to pp. 38, 41, and 79 of Ref. 8 for a more de- 
tailed presentation of the relations sunmarized here. For convenience 
the following explanatory relations for the various angular arcs are 
sunmarized below. 

Ex or Ey = fixed reference line in ecliptic 
fl m 7 (t) = v 7 - fl where x Q s v Q « arc of nodal regression 
TD = x u + Q A (measured in two planes) = yEA 
c = y EM q (o) i.e., at t = o 
s = tan M 7 M 

v * xM 7 = ecliptic projection of xM 
i « M 7 n M 
y = tan i '=£ sin i 
Tl o = DM o = nt+ e -n 
c 7 = y Em 7 (o) at t = o if e g ^ o 

One can then show that 

cos Mn 7 = cos (v - v 7 ) cos M 7 M = ^1 - s^ + - ..•) cos (v - v 7 ) 

(B-4) 
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and since 

( 2 1 2 \ 12 . 
s - - e - g v j V sin + ey sin ... + "g e Y sin + ••• 

(B-5) 


(from p. 41 of Ref. 8). 
while 

£ -y ^ d? sin^ i ^ sin^ i Q = sin^ (5°8 , 43 # ) ^ .008 « 1 (B-6) 


we can approximate to sufficient accuracy 

cos rtn / cos (v - v 7 ) = cos [nt + c - n 7 t - e / 

+ e, e* times periodic terms] 

cos [ (1 - m)t + C - C 7 ] + higher order terms 


(we have divided by n = 1) (B-7) 

sin 2= - sin [(1 - m)t + c - c # ] + H.O.T. (B-8) 

Define: % = £ ttn' = (1 - m) t + c - e' (B-9) 

and note that 

sin m 7 T = sin i sin 0 m f = sin i sin (v* - Q) (B-10) 


With the above relations we can now obtain Eqs. (17) of the text 


y 3 

z 

s 


r 13 COS 5 

- r^ sin ^ (B-l 1) 

- r^ sin i s ^- n (y* “ D) * sin i sin (fl - v ') 
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Appendix C 

TAYLOR SERIES EXPANSION AROUND 

The steps needed in the expansion of the various terms in the La- 

4 

grangian L of Eq. (8) up to fourth order terms [i.e., o(m )] are indi- 
cated below. In dimensionless notation we have 

*14 = F 1L + 7 " here ? 1L * I (l + p) T x + 4 (1 + p > T y 
and thus 

r^ 4 = [| (i + p ) + 3c] 2 + (1 + p) + y] 2 + z 2 = ... algebra 

( 2 2 2 \ 
x + T^y + x + y + z J 

= 1+ (a + b) =1 + 1 (C-l) 


where a and b refer to the two terms following 1. 
This enables us to write r ^ in the form 


r\l = [1 + I ]- 1/2 


, Lj + h 2 . 11 i 3 + l- .Ii 4 + ... 
1 ‘ 2 1 + 8 1 58 1 16 8 1 


(C-2) 


A similar expression applies also to r*^ after replacing x by -x 


24 

in Eq. (C-l). 

Evaluate now the various terms in (C-2) . 


o(m 5 ) 


[i 2 ]: = /^+ 2ab + h 2 

/ 2 2 2 
2ab = 2p(2 + X + 73 y) (x + 73y + x + y + z ) 
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2 

= 4p^x + ,/3y + x 2 + y 2 + z 2 ^ + 2p(x + \/3y) + o(m 3 ) 

- 2p|^2(x + -J3y) + 3 x 2 + 5y 2 + 2z 2 + 2^3 xy] 

Terms independent of x, y, or z have been dropped since they don't con- 
tribute to the final D.E. 

b 2 = x 2 + 2^/3 xy + 3y 2 + 2(x + VSy) ^x 2 + y 2 + z 2 ^ 

+ (x 2 + y 2 ) 2 + z (x 2 + y 2 ) Z 2 + z* 

[i 3 ]: = ^ + ^ 2 b + 3ab 2 + b 3 

neglect as H.O.T. 

3ab 2 -* 6p(x + */3 y) 2 + o(m 3 ) 

b 3 -4 (x + V3y) 3 + 3(x + */3y) 2 (x 2 + y 2 + z 2 ^ + o(m 3 ) 

r 4i 4 

I : only b contributes 

b^ = (x + */5y)** + o(m^) 

Combining the above terms and neglecting noncontributing factors 

gives 


r"^ = - j [p(x + J5y) + (x + J5y + x 2 + y 2 + z 2 )] 

+ | {2p[2(x + V3y) + 3x 2 + 5y 2 + 2z 2 + 2^3 xy] 

+ [(x + J5y) 2 + 2(x + V3y)(x 2 + y 2 + z 2 ) + (x 2 + y 2 ) 
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+ 2 (* 2 + y2 ) * + * 4 * } - H { 6 P<* + V3y) 2 + (x + ./3y) 3 

+ 3(x + 7Iy) 2 (x 2 + y 2 + z 2 )} + 2|_ ( x + ./Jy) 4 (C-3) 

Furthermore 

(T + p)[j U + p) + x] = I (1 + p) 2 + (1 + p ) x 
(1 + p) 3 = 1 + 3p + ... 

I< 1 + 

The Lagragian L in 
it by D/C^ + Let 

set 


-1 -2 

p) + (1 + p) -* x - 2px + noncontributing terms 

(C-4) 

Eq. (8) is made dimensionless by multiplying 
us multiply Eq. (8) by this factor and then 


4 


Thus 


12 


“l + “2 = 1 
D = 1 

and introduce the dimensionless quantity 

- - ^ 

^ ' Pl + liR 

while from before we had defined already the quantity ^ 

4 " “l + “2 


(C-5) 


by Eq. (12). 
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It then follows that 


' M* - A 


Consider the contribution V—. of Earth and Moon to the potential 


energy term in L of Eq. (8) 


EM 


1 

- 1 

- 

p 


r 12 * T iu\ 

^ r !4 r 24 

+ p, 

I 

►-* CJ 
ro 


(C-7) 


We recall that only the x coordinate changes sign when we use the ex- 
pression for r to obtain r^ . A convenient expression for can 

be obtained by making in r~* the following substitution for all odd 
powers of x 


2n+l 2n+l 0 2n+l , 
x — x -2x (n = 0,1) 


When use is made of Eq . (C-6) and the lengthy algebraic manipula- 
tions are carried out, one ends up with a given by 


EM 


L 2 

l X " 

5 

8 

2 12 
y + 2 z 


(1 - 

2p) xy 



p(x 4 

,«>) f 
• ,%)} + { 

- 7 L 

16^ 

x 3 + 




1 - 2 
16 

Hi . 

„ 2 
33 xy - 

l -> 
16 

• 12 

, 2 
: xz 


Uy/3 21 

• -TT yz J 


r 37 

4 

123 2 2 

^ 3 

2 2 

. 33 

2 2 3 

4 3 4 


1128 

X 

vr x y 

+ If) 

x z 

+ u 

y z - 155 

y - 8 2 


¥ 

(1 

- 2|i) x 3 y 

_ 45 0 

¥r 

■ 2 U ) 2' 
xyz 

L 

f 

p 

3 

8 3 

2' 15 2 

< + r y 

3 2 

- 7 z 

♦! 

(1 - 

2p) xy) 


(c 
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The solar contribution V g to the potential energy term in L is 
found below. 


+ y i + z i 
s y s z 


13 


14 


[j a + p) 


x M4 


O + p) y e + y c y 


i + 


Now 


V 

s 




1 2 


(C-9) 


so that only terms of o(m ) or lower must be retained inside the bracket. 
After dropping all terms which do not contain the particle's coordinates 
we get 


( r 13 ’ r u) - ( x s x + V + jj) + ( x s + & y s)( X 


will lead to o(m ) terms 


(C-10) 


and 


2 

r 14 


x + J3y + 


2 

x 


+ y 


(c-u) 


Substitution of (C-10) and (C-ll) into (C-9) results in 
2 

- I(*x + y„y] 

2rf 


V s = -“pT K X « X + v) + ( X s + ^ y sX X s X + V + v)] 
" \ [^ X + ^ y ) + (* 2 + y 2 + 


(C-12) 
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The Hamiltonian 1! is defined by the relation 

— — T • 

H = p - r- L- pr-L 

t *■ 


(C-13) 


whe re 

- '14 • i ' '14 (C ' 1M 

I denotes the identity tensor. The equality p = r^ is a consequence 
of the linear dependence of r^, on the velocity components x,y,z, in 
the rotating coordinate frame. Writing r^ as 

'14 = '1L + ' r r + * x ' 

'lL = '1L T 1L + “ X 'lL 
' T r = iT x + * T y + ' T Z 


" 



?}L 


" r 14 




or, . 


dr 

L 14 




wc get 

• r j = ... algebra . . . 

- H p - ^ C 1 + •’> + p) + * + zu y - yn + v z> T x 
+ [4 '» + \ (1 + P^ 1 + 0 + * * X (’ + 0 ' z v] T y 
+ [4 (1 +■ P> r x P> « y + * + y^ x - x ”y] T z 

(C-15) 


We now introduce the momenta P via Eq. (19), solve for r from (C-14) 
and obtain the expressions 
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* = p x ' 4 + [4 (1 + p) + yl 1 + 0 • ? p - Zv y 

^ = P y + 5" i ^P'li' (1 + p) + X K 1 + 0 + zu x 

5 = P z + [j (1 + p) + xj v y - (1 + p) + yj v x (C-16) 


Also from Eq. (C-14) and Eq. (19) we can write L in the form 


1 (t . £) 2 + I fp + if + I P z . V 

2 \ x 2/ 2 \ y JJ 2 z V EM 


- V 


(C-17) 


If we now substitute (C-17) into (C-13) , make use of (C-14) and 
(C-16), and neglect all the terms which do not depend on the momenta 
P or the particle's position r we end up after a lot of algebra with 
the expression for the Hamiltonian H presented in Eq. (21) of the text. 
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Appendix D 

CANONICAL TRANSFORMATION TO SLOW VARIABLES 

We shall outline here the steps which underlie the canonical trans- 
formation from the variables to the slow set ar 7 ,R 7 . These variables 

are analogous to polar coordinates where corresponds to an amplitude 
and p to a phase shift. We shall find it convenient to use also a car- 
tesian set of generalized coordinates q,p in terms of which the trans- 
formation relations will be developed. 

Let 


s = S(q.p') = S L + S 2 


(D-l) 


be a generating function from the set q,p to a second slowly varying 
set q 7 , p 7 where will be selected to remove from H the 3rd order 
terms (all of which are short period) and to remove all 4th order 
short period, and define 


S 

q 


[ as as as 1 

*r a v 


(1 x 3) row matrix of partial 
derivatives of S 


S T = (3x1) column matrix of partial derivatives 

q 


Then 


p = p 7 + S T (q iP 7 ) 

q 

q = q' - s (q,p 7 ) 
p 


Let 


Aq « q - q 7 

Ap = p - p 7 


(D-2) 


(D-3) 
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and expand S T and S in a Taylor series around the values of q^p'. 
To second orcler in and Ap 


s x(q*p f ) - — m 

q aq 


S(q\p 7 ) + S 


»aq + i Aq T S n , 

1 q q 


Aq + 




(D-4) 


The expressions for q = q(q',p') and p = p(q',p') can then be de- 
ve loped via (D-2) and (D-4) 


q " s /T^P 7 ) - q 7 - t — — Is (q 7 ,p 7 ) + S /Aq + j Aq T S T 
p d T L q 2 q q 


Aq + 


* q* - s /rCq^p 7 ) - s (q / iP / )Aq + 


/*r / 

p A q 


(D-5) 


In .order to prevent carrying unnecessary terms along let us esti- 
mate the order of magnitude of the above terms. Since S will be used 
to perform a transformation of variables in H 7 which contains terms of 

o(m 3 ) and o(m^) then the lowest terms in S will be of o(m^). We might 
3 

also use the notation o(x ) since the x,y,z coordinates are the ones 
to be transformed. 

Let us view q as equivalent to 3 and p as equivalent to a. 

Then the derivatives result in the following orders of magnitude 

S - o(H 3 ) - o(x 3 ) = o(ar 3/2 ) 

S -* S = o(a 1/2 ) = o(x) 

P “ (D-6) 

S q •* S p = o(o 3/2 ) = o(x 3 ) 

s T “* o(S ) = o(x) 
p q P 


We assume also that 
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Aq a; o(x) + higher order terms 


(D-7) 


T 5 

u e thus note that the term Aq S ™ , iq = «(* ) and after operating 

r, q q 

on it with 3/?)^/ it becomes o(x J ) . 

Equation (D-5) can then be written as 


Aq=-S /x-S /T/ Aq+o(x) ••• 
P P q 


(D-8) 


3 2 

and terms of o(x ) are not carried along. Thus to o(x ) we can write 

the following relation 


[i + s n ,] m * - s 

L n n J D 


p q 


which can be inverted to solve for Aq 

*--[ I + s <r/] ls / T ^- s 'T + s 'T' s 'T + °<* 3 > 
L n a -> n 1 n A p q p A 


(D-9) 


where I is the identity matrix. 

From Eq. (D-2) we also note that 

Ap ta o(S •»») = o(x^) + H.O.T. 

q 

Expanding for Ap as was done in (D-9) for Aq we find 

Ap - S - S n , S + o(x 5 ) (D-10) 

q 1 q q p 

The partial derivatives of H can also be treated similarly to the 
partials of S. Thus 
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H = o(H) 

q 


H P " 


For a scleronomic generating function S we have the transformation 


relation for Hamiltonians 


K(q'.p') - H(q,p) 


and expanding H in a Taylor series around q / ,p / gives 


K(q / »P # ) = iKq'.p') + H q / Aq + H p/ A p + 


■ + Ap T H(q',p') + 


= HCq'.p') + H q ,Aq + H p , 


+ i - V + V 


+ 2 - V + %' 


S »T + S /T ' 
p r p q 


I® /T " ® /T * 
L q^ q q 
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The subscript letters A,B,C,D have been introduced merely for 
ease of subsequent identification of the respective brackets f 1 . 
We recall c'nac 


H = + H 0 + H. 

3 4 

o(x 2 ) o(x 3 ) o(x 4 ) 


(D-14) 


3 4 

where H 3 contains x terms and denotes the 4th order terms like x , 

m^x^,px^ etc. This breakdown into and will be made use of in 

choosing the relations defining S.(q / ,p / ) and S_(q / > p / ). 

* 3 ^ 

We shall assume to contain only o(x ) terms and only terms 
of o(x^) , because we shall select so as to remove all 4th order 
short period terms from the Hamiltonian K. 

^ \ " " [ Sl p rr + \n ( Sl p *v + S'*,') (V + ’V)] 


- s - s + s s + s s + s s 
P'1 p'T l p m q ' \rr %rr \n n * 


o(x^) 


p A q p 


o (x) o (x ) 


o(x 2 ) 


o(x 3 ) 


o(x 3 ) 


(D-15) 


Thus to o(x ), which is the highest order retained in all terms, 


V r "^a "* 


V [- 


! 1 * S 2 + S 1 

p ' 1 P*» P^q 


•VK'hd 


(D-16) 


Similarly, the following expressions can be derived 

n * 


S 1 + S 2 

q* q* 


S. S. + H.O.T. 

1 rr * * rr 
q q p 


CD-17) 
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+ H- 


+ H.O 


\ C ] C H q T q ' [ ] A - 2 S l "^V S l 


q q ‘n't 




p q a p q*r 


\ [ 1, H [ ]. -* - j S H { ^ ,S 

2 D pHT,' A Z 1 , p-Tq „ 




Substituting (D-16) and (D-18) through (D-22) into (D-13) 
in the expression 


K -H (0) +H 3 + H 4 + H^|- s -8 2 JW +S 1 


■?’[- 


a'T n'T 


p'V^p'J 


- H, S, + H<“>[ Sl + s - Sl S 1 + H 

V V » [ V* q* q*V P^J * 


+ is, -w A 


2 i rr / i 


q V p't 


+ i S s 

? q'P P q* 1 


.T. 

(D-18) 

(D-19) 

(D-20) 

(D-21) 

(D-22) 

results 



(D-23) 
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Recalling the definition of the Poisson bracket 

[H.S] = H q S T - H p S T (D-24) 

and applying it to the terms of Eq. (D-23) one can obtain after a 
lengthy series of manipulations and combinations of terms the expres- 
sion 


K = H (o) + H 3 + H 4 - [h (o) ,S 1 ] - [h (< 5) ,S 2 ] - [Hj.sJ 

+ \ [H <0) ' S l q/ S l ^ + ?[[ H(0) ' S l]* S l] 


(D-25) 


Let us choose for the definition of the relation 


[h<°\ S i ] - H 3 = 0 


(D-26) 


and thereby remove which contains only short period terms. 
Then 


K = H< 0) + + 1 + J [h 3 . Si ] - [h 3>Si ] 

- [h <0) .S 2 ] = H<°> + H, + \ 

- ? [ H 3* S lJ - [ h(<>) ’ S 2] (D - 27) 


The third and fifth final terms in Eq. (D-27) can be combined 
into the one bracket 
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1 

2 



(D-28) 


We now define S 2 such as to remove all remaining 4th order short 
period terms from K. Now both H 4 and [H^S^ will contain both long 
period ( ) and short period (s«p) terms, which can all be eliminated 
by letting S 2 be defined via 



This leaves the long period form of the Hamiltonian K as 


K = H (o) + H 4 - I [Hj.sJ (D-30) 

and if a 7 and 0* are selected as the canonical variables, rather than 
a' and (t + 0 7 ), the long period perturbation Hamiltonian K 7 is obtained 
as 



(D-31) 


To this expression one must still add the contribution from the 
linear forced solutions x,y due to as indicated in Eq. (49) which 

then finally leads to the relation presented in Eq. (48). 

Comparison of the K from Eq. (D-31) with the K presented on p. 63 
of Ref. 6‘ shows that the two Hamiltonians are not alike. Uiis differ- 
ence can be traced to the particular way in which the time dependent 
generating function S^(q,p 7 ,t) of Ref. 6 was defined there by means of 
an equation in the mixed variables q,p 7 , instead of first carrying out 
the transformation to the new set of coordinates q 7 ,p 7 shown in this 
appendix in Eqs. (D-9) to (D-13) . 

As a consequence of the use of mixed variables , some of the terms 
which would have appeared from the additional Taylor series expansion 
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over q were thus missing and only one half of the terms of the Poisson 
bracket [H^,S^] of Eq. (D-31) showed up in the function $ introduced 
in Ref. C. Tue absence of these additional terms prevented the cancel- 
lation of nonpolynomial terms (i.e., terms which do not arise from 
binomial expansions such as (x + y) n , where n is some finite integer) 
and led to the presence of an extraneous term such as the 
term in Eq. (10) of Ref. 6. 

The source of the incorrect results, which can arise when one oper- 
ates with mixed variables anytime terras higher than of first order are 
retained in the Hamiltonian, were recognized by Prof. Breakwell, who 
then suggested that the correct procedure in choosing the function 
would be to transform first to the new set of coordinates q 7 ,p 7 . The 
implementation of this suggestion led to the developments presented in 
this appendix, and avoided here the presence of the inadmissible non- 
polynomial terms. 

The derivation of the Mathieu type Hamiltonian in Appendix F does 
make use of mixed variables. However, the results obtained there are 
correct since only linear terms were retained in H. 

A last comment should be made regarding the slow variables q 7 ,p 7 , 
or <* 7 ,0 7 . It turns out that it is impossible to prevent the presence 


of some higher order long period terms in S£ which arise because the 

term Sj /Si ^ may contain also long period parts. From this it fol- 

q p T 

lows that in the expression for, say, q 


the last two terms may also make some long period contributions to q, 
which would tend to contradict the assertion that q 7 (and also p 7 ) are 
the only long period variables. This situation is unfortunately un- 
avoidable and cannot be circumvented by redefining or S£ , since the 
elimination of the extra long period terms in q 7 or p 7 via S would 
automatically result in the introduction of unwanted higher order short 
period terms into K that S would be incapable of suppressing simultaneously. 
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Fortunately this impasse is not too serious since the bothersome 
long period terms in Eq. (D-32) are of o(m 4 ) or higher and may be 
safely disregarded within the extent of the present theory inasmuch 
as q 7 does not appear in a linear manner in H. They would pose a prob- 
lem however if the present approach were to be extended to encompass 
some of the higher order terms currently neglected. 
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Appendix E 

SOME ILLUSTRATIVE STEPS IN THE DERIVATION OF 
T.ONG PERIOD TERMS IN E 7 

The steps leading from Eq. (48) to Eq. (50) required by far the 
most time consuming, tedious and exacting manipulations and computa- 
tions of the whole investigation. We shall indicate here only briefly 
as an example a few representative intermediate steps so as to pro- 
vide the reader with a feeling for what is involved here. 

First a general remark concerning the Poisson bracket 
In the expanded form, and using the polar canonical variables & and 
8, this becomes 


[ H 3 ,S l] H 3P.' S l(y' ' H 3cy 


(E-l) 


where the tensor notation for summation over i = 1,2,3 has been used. 
The same bracket, when and are expressed in cartesian coordinates 
x,y ,z,P x ,Py , can also be written as 


[ H 3> S ll 


H 3x S 1P 


VlP 


- «3P S 1x 


»3P S 


lz 


(E-2) 


which indicates that the bracket will give rise only to polynomial 
2 2 3 4 

terms of the form x P , x P , y , etc. 

z y * J * 

From this it follows that when one evaluates the long period terms 
in the polar coordinates used in Eq. (E-l) one must be careful to ob- 
serve that only polynomial type terms should be retained. Thus, one 
can obtain secular terras like , 7c^ ••• etc. or slowly varying 

terms like (•••) a'a' cos f - o^t + •••] or (*••) 
cos [(u^ - 3u^)t + •••], but not terms such as (••■) a *3/ 2 a / l/2 x 

cos [(u^ - 3u^)t + ■ • • ] because such a term could not arise from products 
3 3 

of the form x^y 2 or y^x 2 which are the only kind that could give rise 
to long period trigonometric terms with a frequency - 3u^ . The 
quantities x^,y 2 , etc. represent the term in x and the term in 
y of Eq. (36), respectively. 
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This polynomial requirement is not satisfied in Eq. (10) of 
Ref. (6) which contains the term cos [,0609t + 

+ 311^ - 4.48°]. 

We shall indicate now a few steps in the evaluation of one of 
the long period terms in the Poisson bracket. For convenience we let 


H 3 ’ H 32 


33 


where 

**32 = co P^ anar ( x >y) terms in 
H33 = °ut-of-plane (2) terms in 

Similarly 


S 12 + S 13 


(E-3) 


(E-4) 


where 



(E-5) 


from Appendix D. (We recall that [H,Sl = - dS lot when H is treated 
as the momentum conjugate to the coordinate t.) 

Then 


H 3’ S 1. " l H 32 + H 33* S 12 + S 13. 


* [Vl 2 ] + [ H 32* S 1 3 ] + [ H 33> S 1 2 ] + [ H 33- S l 3 ] (E - 6) 


Let us take the first bracket 
only the component in 
the product of the two parts 


in Eq. (E-6) , and consider for example 
it. It can be shown that it arises from 
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**32^ ‘ ^ II M < a l + a 2> + I N(33a 3 - ? V} A 

+ a\' 2 {M(bj + b 2 ) + N(33b 3 - 7b 4 )}^ 

+ «- 1/ V|iM(c 1 + c 2 ) + 5 - N(33c 3 - 7c 4 )|^ (E- 6) 

and 

S 12 Pl - - “I' 2 ( M < a l' + a 2> + N(33 “3 - ? < ) } D 

- « 1 »2 /2 { M(b i' + b 2> + N(33b 3 " 7b 4 ) ) E 

- a l /2 "2{ M(c l' + C P + N(33c 3 " 7 c 4 ) } f 

where we have dropped for convenience the primes on the oc’s and 6 s. 
The quantities ••• a^, ••• b^, c^ * * * are defined 

in terms of x = A 1 C (1) + A 2 C (2) and * “ A 1 C (1)+S 1 + A 2 C (2)+6 2 ’ WherC 
(1) - (2) r ^ and A X ,A' ... 6^ are obtained from Eq. (36). 

H and N are two constants defined as M - lv/3/16 and N = (1 - 2p,)/16. 

In terms of the above constants one can obtain the following ex- 
pressions 

a l = a l ‘ I A l A l( C (l)+6 1 + \ C 3(l)+6 1 + ? C U)-6 1 ) 

1 " “l “l 1 

C 2(l)+(2)+6 1 + 2^ - ^ C 2(l)-(2)+6 1 J 
+ I A 1 A 2 j^u.^ m 2 C 2(l)+(2)+6 2 + 2^ - ^ C 2(l) -(2) -fij 

(E-7) 


332 


THREE-DIMENSIONAL, NON-LINEAR RESONANCES 


and similar expressions for all the other quantities inside the ( ) 
brackets . 

We observe from Eq. (E-6) that in the Poisson bracket, the coef- 
2 

ficient of would arise from the product of brackets f } A with [ 
and in the same manner we note that the 


coefficient of *♦ results from { ] A « [ } p ; f }g« { 1 £ ; { } c *f ] D 

«} /2 or l' 2 -* results free ( }„•{ } F ; f }<.- { } £ 


□'2 *♦ results from { } c »{ } p 


(E-8) 


Products of brackets { } as indicated in Eq. (E-8) arise in all 
the partial derivative products of with S^, and must be sumned up 
for every combination ot n ot m to obtain the final value of the coefficient 

1 J 

for that particular combination of or's. 

2 


have 


For example, to obtain the coefficient of ar^ in ^123 


V - { } A f = - I { M(3 1 + a 2 } + H(33a J - 7a 4>} 

• |M(a' + a') + N(33a' - 73^)} (E-9) 

where the following relations among the a's apply here 

(E-10) 

Expanding and using the appropriate relations for the a's (not 
shown here) gives 
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°V ■ + a 2 )2 + 3MN(a l + a 2 )(33a 3 - 7a 4 ) + |N 2 (33a 3 - 7a 4 > 2 } 

(E-ll) 


| M 2 (a 1 + a 2 ) 2 = | M 2 (a 2 + 23^ + a 2 ) 


-1^ 


1 a 4 a ' 2 / 3 a 1 r \ . 1 A 2 A' 4 f 3 + r \ + 1 a' 6 . 

ZT A 1 A 1 \C + 2 c 26 J + 4 A 1 A 1 \l + C 2 6l j + 17 A 1 


(E-12) 


3MN(a 1 + a 2 )(33a 3 - 7a 4 ) = 3MN 


- 7 


r e t| *!',,)•! Vi 5 ’ ” 


(F.-13) 


I N 2 (33a 3 - 7a 4 ) 2 = | 


N 2 ^089 i|A 2 A' 4 (| + \ C 2fi J| - 46 2 jl A 4 a( 2 | + 

(E14) 


When the numerical values for A^,A*,A 2 ,A 2 ,C^,C26^, etc. are sub- 
stituted into Eqs. (E-12) through (E-14) and all the terms added, one 
obtains the result 


-92.871a 2 


(E-15) 


This s ame , or a similar, procedure must be repeated for every 
combination of oAs which arises from all the terms of the Poisson 
bracket . 
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Appendix F 


MATHIEU TYPE HAMILTONIANS 


Consider the near-resonant Mathieu equation 


X + u?£l + T, cos 2u, Q t (1 + c)Jx = 0 (F-l) 

where T|, €, « 1. The effect of the trigonometric coefficient is to 
introduce a parametric excitation term into the simple harmonic oscil- 
lator model. 

It is easily seen that as e -* 0 a resonant forcing term will arise 
in case a perturbation solution is attempted, after the solution 

to the equation 


X + a) X = 0 (F- 

o 

is substituted back into Eq. (F-l) to provide the next higher terra. 
The Hamiltonian of system (F-l) is 


H - I [p 2 + .V] - H<°> + H' 


id = a) 1 + T1 cos 2 uj t(l + c) 
oL o 


X,p = generalized coordinate and momentum, respectively 

= Hamiltonian of simple harmonic oscillator of frequency 
H / = perturbation Hamiltonian (for 71 « 1) 

The solution corresponding only to H^°^ is 


x (o> . js sin „ (t + 


(t + B) 
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where a, 3 a *e constants of integration. 

When H # is included, or and $ become functions of time t. It is 
n fill t-n ormcider isainly the lcng period variations in u and ", since 
these basically tell us the most about the "averaged" long term behavior 
of the system. 

The canonical transformations of variables shown next serve the 
purpose of suppressing all short period terms in H. We assume that or 
and £ can be decomposed into short period and long period (or 7 ,0 7 ) com- 
ponents, i.e.. 


o at + or 


s.p. 


p -* p + p, 


s.p. 


Introducing into the Hamiltonian H 7 


H 7 = j u^T|X^ cos 2uu Q t (1 + e) 


(F-5) 


and rearranging terms gives 

H 7 = — ^ {cos 2w o t(l + O - ^ cos {2u) Q t(l + e) + 2u> o (t + P) j 

- j cos 2iu o (ft - p)| (F-6) 

The last term with angular velocity 2u) o f « 2(1)^ is of low frequency 
and thus gives a contribution to the long period part of H 7 . 

Let this term be designated by H 7 

H 7 = - cos 2iL Q (ct - ^) (F-7) 

Note that a still contains s.p. terms so that H 7 still is not the 
final form of the desired long period Hamiltonian. 
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To obtain the D.E. for a r 7 we introduce a generating function 
S(a / ,0,t) of the Hamilton- Jacobi equation which, to first order terms 
can be written in the form 


s = a'e + ip^c/.S.t) 


(F-8) 


in terms of the new momenta a and the old coordinates Thus 


Of 


as 

90 



0 


9S 

9<y T 


= e + n — ; 

da 


(F-9) 


relate the old and new coordinates and momenta. a / ,0 / form a canonical 
set with respect to a long period Hamiltonian K 7 , such that 


where 


b' 


_ dK/ 
90 7 
9k7 
3c/ 


(F-10) 


K'-H' + f 


(F-ll) 


The Hamiltonian K 7 of Eq. (F-ll) is treated as a function of the 
coordinates c/,0 7 and t, after the transformation relations (F-9) are 
substituted into the right hand side of (F-ll). 

To linear terms only 

H'(ff,e.t) = h'(«' + TP lfl .e,t) 2= + H.O.T. (F-12) 


and thus 
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K 7 ^ H 7 (a 7 , 6,t) + H 7 (c/,0,t) + T]S + H.O.T. (F-13) 
sp 1 . 


The function is chosen in such a way as to eliminate all s.p. 
terms from (F-13) . We thus require 


T1S 1 + = 0 


(F-14) 


from which results 


S, = o ... ^ sin 2 U u o (2t + ft + 3) - CiV sin 2 V (1 + C) 

o v 

(F -15) 


1 8o) (2 + €) 


Expanding 0 around fl 7 in h' of Eq. (F-13) gives, again to first 

order 


K 7 = H 7 (o 7 ,0 ,t) 


- ^ cos 2 oj (et - p 7 ) 

4 O 


(F-16) 


with the aid of Eq. (F-7) . 

Equation (F-16) defines a long period, time dependent, Mathieu 
type Hamiltonian. 


Stability Analysis 

The differential equations for a 7 and p 7 are summarized by the 
matrix equation 


- 4 


[ d/flfl' 

i S/3a' 


(F-17) 


Rather than 3 olve Eq. (F-17) directly for a ' and ? ' it is more 

* * / \ 

convenient to introduce a further generating functions S (a ,0 >w 
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so as to eliminate the explicit dependence on time and thereby trans- 
/ * 

form K to a new Hamiltonian K which is a constant of the motion, i.e., 
* * 

K = K = constant. 

° * 

Let us define a variable p by the relation 

8* = B' - ft (F-18) 

* 

and then take S to be given by 

S* = a* (e' - ft) (F-19) 


Since 


■k t * 

s * = 0 - Ct - p 


and 


S 

B 


* 

<y 


the two variables a and p are canonically related to the new Hamil- 
* 

tonian K which becomes 


* / * / * q< Tl * 

K «* K + = K - €a = - ^r 11 cos 2u) B - 

t 4 0 


That K is an integral constant of the motion is 
fact that 


dK 

dt 


* * y 

• ★ _ bk •* x •* . ar 

= K = — + — 8 + fr 

ta ae f 

* 

= by Hamilton's equation = — * 
do 


($) 


* * , 

K = K - constant of the motion 


fcr* (F-20) 

evident from the 
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The stability or instability of systems governed by Hamiltonians 
of the form (F-20) can readily be established based on a comparison 
of the magnitude of the cocfIiv.i.t;iiLi» vf ur <*uu ur ^<d 0 p xu V r -^ > • 

The relative magnitude required of these coefficients for insta- 
bility or stability to exist can be determined as shown below, and the 
conclusions then checked by referring to the known stability regions 
of the Mathieu plane . 

* 

From Eq. (F-20) we obtain the differential equations for a 

* 

o' T| * 

°- “ - sin 2 id 3 (F-21) 

2 o 

and squaring. 



.*2 

1 _ 

4 * 

V -I 

4c * 

2 



* ° 

o' T| 

h ~1T~ a 

a *n 



(F-22) 


after sin^ 2 it B* is replaced from Eq. (F-20) and the constancy of K 
o 

is made use of. 

The condition necessary for 6t to vanish is obtained by setting 
the right hand side of (F-22) equal to zero, i.e., at the intersection 
of the two lines. 


and 


± Of 


4 * 4f * 

— K + — a 

ti o n 


(F-23) 


This is shown in the next sketch. Fig. (F-l) . 

From this sketch we see that for Qf 0 > Qf cr and 6f Q > 0 the variation 
of a* is bounded by the lines y = ± a* if 4 c/T| > 1, thus implying a 
stable motion, while if 4c/7| < 1, a grows without limit. 

Hence , if 
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4c 

11 


< i 


0 AT POINTS® 


Fig. (F-l): Stability Conditions for Mathieu 

Type Hamiltonians 


— > 1 stability exists 

(l.e., x « bounded) 

4c 

— < 1 -* instability exists 

(i.e., x oo as t *♦ ®) 

This leads to the conclusion that the motion is unstable if in the 
* * 

Hamiltonian K the coefficient of a is smaller than the coefficient 
* * 
of o' cos 2 oj q P . 

The above conclusion is also borne out by considering the Mathieu 
Equation in the standard form, 

d 2 v 

— -it + (a - 2q cos 2z)v = 0 (F-24) 

dz 


Referring to Eq. (F-l) and introducing a new time variable t 
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t = w t(i + e) + 


and 


(F-25) 


Equation (F-l) reduces to (F-24) if 


(1 + O 


and 


(F-26) 


2(1 + C) 


2 2 


The stability boundaries of the Mathieu plane (q,a) in the vicin- 
ity of the region a 1 are shown below (see for instance p. 114 of 
Ref. 11). 





+ . . 


\ 


PERIODIC SOLUTION 
BOUNDARIES WHICH C* N 
flt A ppKOttMATED BY 
STRAIGHT LINES NEAR 
THE POINT (0,0 


Fig. (F-2) : Stability Boundaries in Mathieu Plane (q,a) 

On se 1 the slope (da/dq)^ 0 -1. 

The slope of line N - N 7 is, from Eq. (F-26), (da/dq)^ _^/ ~ 2/t 1, 
and q - T|/2 for a - 1 (pt. 1). 
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As the value of (a) changes from 1 to 1/(1 +6 ) 2 ~ 1 - 2« + 3e 2 , point 1 trans- 
lates along line N - N' to either point 2 or 3, depending on whether the solution remains 
unstable (pt. 2) or enters the stable region (pt. 3). 

Comparing the values of q on se. and N-N' for the same change Aa ~ _ 2f + H . O. T. 
we have 


On se^ 



Aa = (-1) (-26) = 2e 
q=0 


On 


N-N ' 

N 2 \da/ 


Aa = - - 
N-N 


H.O.T. 

Hence, if q N > q^, i.e., ifij/2 ^ > 2« , or 4e/T} < 1, point 1 moves 
and indicates as unstable solution. This is in aggreement with the conclusion 
earlier via the Hamiltonian approach. 


to point 2 
reached 
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